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<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> modif ied_base 
<222> {11) . . . (11) 
<223> I 

<221> modif ied_base 
<222> (14) . . . (14) 
<223> I 

<221> modif ied_base 
<222> (17) . . . (17) 
<223> I 

<223> Degenerate oligonucleotides UPMU used to isolate 
AtMSH3 and AtMSH6 . 
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<400> 1 

ctggatccac nggnccnaay atg 



23 



<210> 2 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 



a » 



<223> Degenerate oligonucleotides DOMU used to isolate 
AtMSH3 and AtMSHS . 

<221> modif ied_base 
<222> (15) , . . (15) 
<223> I 

<221> modif ied_base 
<222> (18) . . . (18) 
<223> I 



<400> 2 

ctggatccrt artgngtnrc raa 

<210> 3 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> MSH3 specific primer 636 for PCR using cDNA of 
Arabidopsis thaliana ecotype Columbia. 



<400> 3 

tgctagtgcc tcttgcaagc teat 24 

<210> 4 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer API for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia containing adapter 
sequences ligated to both its ends. 



<210> 5 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Primer AP2 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia containing adapter 
sequences ligated to both its ends. 



<300> 
<301> 
<302> 
<303> 
<306> 
<307> 



Reenan and Kolodner 

Genetics 

132 

963-973 
1992 



<400> 4 

ccatcctaat acgactcact atagggc 



27 



T 



<400> 5 

actcactata gggctcgagc ggc 23 

<210> 6 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> MSH3 specific primer S525 for PCR using cDNA of 
Arabidopsis thaliana ecotype Columbia. 

<400> 6 

aggttctgat tatgtgtgac gctttactta 30 

<210> 7 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> MSH3 specific primer S51 for PCR using cDNA of 
Arabidopsis thaliana ecotype Columbia. 

<400> 7 

ggatcgggta ctgggttttg agtgtgagg 2 9 

<210> 8 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> MSH3 specific primer 635 for PCR using cDNA of 
Arabidopsis thaliana ecotype Columbia. 

<400> 8 

gcacgtgctt gatggtgttt tcac 24 

<210> 9 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> MSH3 specific primer S523 for PCR using cDNA of 
Arabidopsis thaliana ecotype Columbia. 

<400> 9 

tcagacagta tccagcatgg cagaagta 2 8 

<210> 10 
<211> 33 
<212> DNA 

<213> Artificial Sequence 



<220> 



1 



<223> MSH3 specific primer 1S5 for PCR using cDNA of 
Arabidopsis thaliana ecotype Columbia. 

<400> 10 

atcccgggat gggcaagcaa aagcagcaga cga 33 

<210> 11 
<211> 27 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> MSH3 specific primer S53 for PCR using cDNA of 
Arabidopsis thaliana ecotype Columbia. 

<400> 11 

gacaaagagc gaaatgaggc cccttgg 27 

<210> 12 
<211> 1250 
<212> DMA 

<213> Arabidopsis thaliana ecotype Columbia 
<220> 

<221> misc_f eature 
<222> (0) . . . (0) 
<223> Clone 52 

<400> 12 

cccgggatgg gcaagcaaaa gcagcagacg atttctcgtt tcttcgctcc caaacccaaa 60 
tccccgactc acgaaccgaa tccggtagcc gaatcatcaa caccgccacc gaagatatcc 120 
gccactgtat ccttctctcc ttccaagcgt aagcttctct ccgaccacct cgccgccgcg 180 
tcacccaaaa agcctaaact ttctcctcac actcaaaacc cagtacccga tcccaattta 240 
caccaaagat ttctccagag atttctggaa ccctcgccgg aggaatatgt tcccgaaacg 300 
tcatcatcga ggaaatacac accattggaa cagcaagtgg tggagctaaa gagcaagtac 360 
ccagatgtgg ttttgatggt ggaagttggt tacaggtaca gattcttcgg agaagacgcg 42 0 
gagatcgcag cacgcgtgtt gggtatttac gctcatatgg atcacaattt catgacggcg 480 
agtgtgccaa catttcgatt gaatttccat gtgagaagac tggtgaatgc aggatacaag 540 
attggtgtag tgaagcagac tgaaactgca gccattaagt cccatggtgc aaaccggacc 600 
ggcccttttt tccggggact gtcggcgttg tataccaaag ccacgcttga agcggctgag 660 
gatataagtg gtggttgtgg tggtgaagaa ggttttggtt cacagagtaa tttcttggtt 720 
tgtgttgtgg atgagagagt taagtcggag acattaggct gtggtattga aatgagtttt 780 
gatgttagag tcggtgttgt tggcgttgaa atttcgacag gtgaagttgt ttatgaagag 84 0 
ttcaatgata atttcatgag aagtggatta gaggctgtga ttttgagctt gtcaccagct 900 
gagctgttgc ttggccagcc tctttcacaa caaactgaga agtttttggt ggcacatgct 960 
ggacctacct caaacgttcg agtggaacgt gcctcactgg attgtttcag caatggtaat 1020 
gcagtagatg aggttatttc attatgtgaa aaaatcagcg caggtaactt agaagatgat 10 8 0 
aaagaaatga agctggaggc tgctgaaaaa ggaatgtctt gcttgacagt tcatacaatt 1140 
atgaacatgc cacatctgac tgttcaagcc ctcgccctaa cgttttgcca tctcaaacag 1200 
tttggatttg aaaggatcct ttaccaaggg gcctcatttc gctctttgtc 1250 

<210> 13 
<211> 34 
<212> DNA 

<213> Artificial Sequence 



<223> MSH3 specific primer 2S5 for PCR using cDNA of 
Arabidopsis thaliana ecotype Columbia. 

<400> 13 

atcccgggtc aaaatgaaca agttggtttt agtc 34 

<210> 14 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> MSH3 specific primer S52 for PCR using cDNA of 
Arabidopsis thaliana ecotype Columbia. 

<400> 14 

gccacatctg actgttcaag ccctcgc- 27 

<210> 15 
<211> 2110 
<212> DNA 

<213> Arabidopsis thaliana ecotype Columbia 
<220> 

<221> misc__f eature 
<222> (0) . . . (0) 
<223> Clone 13 

<400> 15 

gccacatctg actgttcaag ccctcgccct aacgttttgc catctcaaac agtttggatt 60 
tgaaaggatc ctttaccaag gggcctcatt tcgctctttg tcaagtaaca cagagatgac 120 
tctctcagcc aatactctgc aacagttgga ggttgtgaaa aataattcag atggatcgga 180 
atctggctcc ttattccata atatgaatca cacacttaca gtatatggtt ccaggcttct 240 
tagacactgg gtgactcatc ctctatgcga tagaaatttg atatctgctc ggcttgatgc 300 
tgtttctgag atttctgctt gcatgggatc tcatagttct tcccagctca gcagtgagtt 360 
ggttgaagaa ggttctgaga gagcaattgt atcacctgag ttttatctcg tgctctcctc 420 
agtcttgaca gctatgtcta gatcatctga tattcaacgt ggaataacaa gaatctttca 480 
tcggactgct aaagccacag agttcattgc agttatggaa gctattttac ttgcggggaa 540 
gcaaattcag cggcttggca taaagcaaga ctctgaaatg aggagtatgc aatctgcaac 600 
tgtgcgatct actcttttga gaaaattgat ttctgttatt tcatcccctg ttgtggttga 660 
caatgccgga aaacttctct ctgccctaaa taaggaagcg gctgttcgag gtgacttgct 720 
cgacatacta atcacttcca gcgaccaatt tcctgagctt gctgaagctc gccaagcagt 780 
tttagtcatc agggaaaagc tggattcctc gatagcttca tttcgcaaga agctcgctat 840 
tcgaaatttg gaatttcttc aagtgtcggg gatcacacat ttgatagagc tgcccgttga 900 
ttccaaggtc cctatgaatt gggtgaaagt aaatagcacc aagaagacta ttcgatatca 960 
tcccccagaa atagtagctg gcttggatga gctagctcta gcaactgaac atcttgccat 1020 
tgtgaaccga gcttcgtggg atagtttcct caagagtttc agtagatact acacagattt 1080 
taaggctgcc gttcaagctc ttgctgcact ggactgtttg cactcccttt caactctatc 1140 
tagaaacaag aactatgtcc gtcccgagtt tgtggatgac tgtgaaccag ttgagataaa 12 00 
catacagtct ggtcgtcatc ctgtactgga gactatatta caagataact tcgtcccaaa 1260 
tgacacaatt ttgcatgcag aaggggaata ttgccaaatt atcaccggac ctaacatggg 1320 
aggaaagagc tgctatatcc gtcaagttgc tttaatttcc ataatggctc aggttggttc 1380 
ctttgtacca gcgtcattcg ccaagctgca cgtgcttgat ggtgttttca ctcggatggg 1440 
tgcttcagac agtatccagc atggcagaag tacctttcta gaagaattaa gtgaagcgtc 1500 
acacataatc agaacctgtt cttctcgttc gcttgttata ttagatgagc ttggaagagg 1560 
cactagcaca cacgacggtg tagccattgc ctatgcaaca ttacagcatc tcctagcaga 1620 
aaagagatgt ttggttcttt ttgtcacgca ttaccctgaa atagctgaga tcagtaacgg 1680 



attcccaggt tctgttggga cataccatgt ctcgtatctg acattgcaga aggataaagg 1740 
cagttatgat catgatgatg tgacctacct atataagctt gtgcgtggtc tttgcagcag 1800 
gagctttggt tttaaggttg ctcagcttgc ccagatacct ccatcatgta tacgtcgagc 1860 
catttcaatg gctgcaaaat tggaagctga ggtacgtgca agagagagaa atacacgcat 1920 
gggagaacca gaaggacatg aagaaccgag aggcgcagaa gaatctattt cggctctagg 1980 
tgacttgttt gcagacctga aatttgctct ctctgaagag gacccttgga aagcattcga 2040 
gtttttaaag catgcttgga agattgctgg caaaatcaga ctaaaaccaa cttgttcatt 2100 
ttgacccggg 2110 

<210> 16 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> MSH3 specific primer S51 for PCR using cDNA of 
Arabidopsis thaliana ecotype Columbia. 

<400> 16 

ggatcgggta ctgggttttg agtgtgagg 29 

<210> 17 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> MSH3 specific primer S525 for PCR using cDNA of 
Arabidopsis thaliana ecotype Columbia. 

<400> 17 

aggttctgat tatgtgtgac gctttactta 30 

<210> 18 
<211> 3522 
<212> DNA 

<213> Arabidopsis thaliana ecotype Columbia 

<220> 
<221> CDS 

<222> (100) . . . (3342) 

<223> AtMSH3 full-length cDNA and deduced sequence of 
the encoded polypeptide. 

<400> 18 

cctaagaaag cgcgcgaaaa ttggcaaccc aagttcgcca tagccacgac cacgaccttc 60 
catttctctt aaacggagga gattacgaat aaagcaatt atg ggc aag caa aag 114 

Met Gly Lys Gin Lys 
1 5 

cag cag acg att tct cgt ttc ttc get ccc aaa ccc aaa tec ccg act 162 
Gin Gin Thr lie Ser Arg Phe Phe Ala Pro Lys Pro Lys Ser Pro Thr 
10 15 20 



cac gaa ccg aat ccg gta gec gaa tea tea aca ccg cca ccg aag ata 210 
His Glu Pro Asn Pro Val Ala Glu Ser Ser Thr Pro Pro Pro Lys He 
25 30 35 



tec gec act gta tec ttc tct cct tec aag cgt aag ctt etc tec gac 258 
Ser Ala Thr Val Ser Phe Ser Pro Ser Lys Arg Lys Leu Leu Ser Asp 
40 45 50 



cac etc gec gec gcg tea ccc aaa aag cct aaa ctt tct cct cac act 
His Leu Ala Ala Ala Ser Pro Lys Lys Pro Lys Leu Ser Pro His Thr 
55 60 65 



306 



caa aac cca gta ccc gat ccc aat tta cac caa aga ttt etc cag aga 354 
Gin Asn Pro Val Pro Asp Pro Asn Leu His Gin Arg Phe Leu Gin Arg 
70 75 80 - 85 

ttt ctg gaa ccc teg ccg gag gaa tat gtt ccc gaa acg tea tea teg 402 
Phe Leu Glu Pro Ser Pro Glu Glu Tyr Val Pro Glu Thr Ser Ser Ser 
90 95 100 

agg aaa tac aca cca ttg gaa cag caa gtg gtg gag eta aag age aag 4 50 
Arg Lys Tyr Thr Pro Leu Glu Gin Gin Val Val Glu Leu Lys Ser Lys 
105 110 115 

tac cca gat gtg gtt ttg atg gtg gaa gtt ggt tac agg tac aga ttc 498 
Tyr Pro Asp Val Val Leu Met Val Glu Val Gly Tyr Arg Tyr Arg Phe 
120 125 130 

ttc gga gaa gac gcg gag ate gca gca cgc gtg ttg ggt att tac get 546 
Phe Gly Glu Asp Ala Glu lie Ala Ala Arg Val Leu Gly He Tyr Ala 
135 140 145 

cat atg gat cac aat ttc atg acg gcg agt gtg cca aca ttt cga ttg 594 
His Met Asp His Asn Phe Met Thr Ala Ser Val Pro Thr Phe Arg Leu 
150 155 160 165 

aat ttc cat gtg aga aga ctg gtg aat gca gga tac aag att ggt gta 642 
Asn Phe His Val Arg Arg Leu Val Asn Ala Gly Tyr Lys He Gly Val 
170 175 180 

gtg aag cag act gaa act gca gec att aag tec cat ggt gca aac egg 690 
Val Lys Gin Thr Glu Thr Ala Ala He Lys Ser His Gly Ala Asn Arg 
185 190 195 

ace ggc cct ttt ttc egg gga ctg teg gcg ttg tat ace aaa gee acg 738 
Thr Gly Pro Phe Phe Arg Gly Leu Ser Ala Leu Tyr Thr Lys Ala Thr 
200 205 210 

ctt gaa gcg get gag gat ata agt ggt ggt tgt ggt ggt gaa gaa ggt 786 
Leu Glu Ala Ala Glu Asp He Ser Gly Gly Cys Gly Gly Glu Glu Gly 
215 220 225 

ttt ggt tea cag agt aat ttc ttg gtt tgt gtt gtg gat gag aga gtt 834 
Phe Gly Ser Gin Ser Asn Phe Leu Val Cys Val Val Asp Glu Arg Val 
230 235 240 245 

aag teg gag aca tta ggc tgt ggt att gaa atg agt ttt gat gtt aga 882 
Lys Ser Glu Thr Leu Gly Cys Gly He Glu Met Ser Phe Asp Val Arg 
250 255 260 



t> 



gtc ggt gtt gtt ggc gtt gaa att teg aca ggt gaa gtt gtt tat gaa 930 
Val Gly Val Val Gly Val Glu He Ser Thr Gly Glu Val Val Tyr Glu 
265 270 275 

gag ttc aat gat aat ttc atg aga agt gga tta gag get gtg att ttg 978 
Glu Phe Asn Asp Asn Phe Met Arg Ser Gly Leu Glu Ala Val He Leu 
280 285 290 

age ttg tea cca get gag ctg ttg ctt ggc cag cct ctt tea caa caa 1026 
Ser Leu Ser Pro Ala Glu Leu Leu Leu Gly Gin Pro Leu Ser Gin Gin 
295 300 305 

act gag aag ttt ttg gtg gca cat get gga cct acc tea aac gtt cga 1074 
Thr Glu Lys Phe Leu Val Ala His Ala Gly Pro Thr Ser Asn Val Arg 
310 315 320 325 

gtg gaa cgt gec tea ctg gat tgt ttc age aat ggt aat gca gta gat 1122 
Val Glu Arg Ala Ser Leu Asp Cys Phe Ser Asn Gly Asn Ala Val Asp 
330 335 340 

gag gtt att tea tta tgt gaa aaa ate age gca ggt aac tta gaa gat 1170 
Glu Val He Ser Leu Cys Glu Lys He Ser Ala Gly Asn Leu Glu Asp 
345 350 355 

gat aaa gaa atg aag ctg gag get get gaa aaa gga atg tct tgc ttg 1218 
Asp Lys Glu Met Lys Leu Glu Ala Ala Glu Lys Gly Met Ser Cys Leu 
360 365 370 

aca gtt cat aca att atg aac atg cca cat ctg act gtt caa gee etc 1266 
Thr Val His Thr He Met Asn Met Pro His Leu Thr Val Gin Ala Leu 
375 380 385 

gee eta acg ttt tgc cat etc aaa cag ttt gga ttt gaa agg ate ctt 1314 
Ala Leu Thr Phe Cys His Leu Lys Gin Phe Gly Phe Glu Arg He Leu 
390 395 400 405 

tac caa ggg gec tea ttt cgc tct ttg tea agt aac aca gag atg act 1362 
Tyr Gin Gly Ala Ser Phe Arg Ser Leu Ser Ser Asn Thr Glu Met Thr 
410 415 420 

etc tea gec aat act ctg caa cag ttg gag gtt gtg aaa aat aat tea 1410 
Leu Ser Ala Asn Thr Leu Gin Gin Leu Glu Val Val Lys Asn Asn Ser 
425 430 435 

gat gga teg gaa tct ggc tec tta ttc cat aat atg aat cac aca ctt 1458 
Asp Gly Ser Glu Ser Gly Ser Leu Phe His Asn Met Asn His Thr Leu 
440 445 450 

aca gta tat get tec agg ctt ctt aga cac tgg gtg act cat cct eta 1506 
Thr Val Tyr Ala Ser Arg Leu Leu Arg His Trp Val Thr His Pro Leu 
455 460 465 

tgc gat aga aat ttg ata tct get egg ctt gat get gtt tct gag att 1554 
Cys Asp Arg Asn Leu He Ser Ala Arg Leu Asp Ala Val Ser Glu He 
470 475 480 485 

tct get tgc atg gga tct cat agt tct tec cag etc age agt gag ttg 1602 



Ser Ala Cys Met Gly Ser His Ser Ser Ser Gin Leu Ser Ser Glu Leu 
490 495 500 



gtt gaa gaa ggt tct gag aga gca att gta tea cct gag ttt tat etc 1650 
Val Glu Glu Gly Ser Glu Arg Ala lie Val Ser Pro Glu Phe Tyr Leu 
505 510 515 



gtg etc tec tea gtc ttg aca get atg tct aga tea tct gat att caa 
Val Leu Ser Ser Val Leu Thr Ala Met Ser Arg Ser Ser Asp lie Gin 
520 525 530 



1698 



cgt gga ata aca aga ate ttt cat egg act get aaa gec aca gag ttc 1746 
Arg Gly He Thr Arg He Phe His Arg Thr Ala Lys Ala Thr Glu Phe 
535 540 545 

att gca gtt atg gaa get att tta ctt gcg ggg aag caa att cag egg 1794 
He Ala Val Met Glu Ala He Leu Leu Ala Gly Lys Gin He Gin Arg 
550 555 560 565 

ctt ggc ata aag caa gac tct gaa atg agg agt atg caa tct gca act 1842 
Leu Gly He Lys Gin Asp Ser Glu Met Arg Ser Met Gin Ser Ala Thr 
570 575 580 

gtg cga tct act ctt ttg aga aaa ttg att tct gtt att tea tec cct 1890 
Val Arg Ser Thr Leu Leu Arg Lys Leu He Ser Val He Ser Ser Pro 
585 590 595 

gtt gtg gtt gac aat gec gga aaa ctt etc tct gec eta aat aag gaa 193 8 
Val Val Val Asp Asn Ala Gly Lys Leu Leu Ser Ala Leu Asn Lys Glu 
600 605 610 

gcg get gtt cga ggt gac ttg etc gac ata eta ate act tec age gac 1986 
Ala Ala Val Arg Gly Asp Leu Leu Asp He Leu He Thr Ser Ser Asp 
615 620 625 

caa ttt cct gag ctt get gaa get cgc caa gca gtt tta gtc ate agg 2034 
Gin Phe Pro Glu Leu Ala Glu Ala Arg Gin Ala Val Leu Val He Arg 
630 635 640 645 

gaa aag ctg gat tec teg ata get tea ttt cgc aag aag etc get att 2082 
Glu Lys Leu Asp Ser Ser He Ala Ser Phe Arg Lys Lys Leu Ala He 
650 655 660 

cga aat ttg gaa ttt ctt caa gtg teg ggg ate aca cat ttg ata gag 2130 
Arg Asn Leu Glu Phe Leu Gin Val Ser Gly He Thr His Leu He Glu 
665 670 675 

ctg ccc gtt gat tec aag gtc cct atg aat tgg gtg aaa gta aat age 2178 
Leu Pro Val Asp Ser Lys Val Pro Met Asn Trp Val Lys Val Asn Ser 
680 685 690 

acc aag aag act att cga tat cat ccc cca gaa ata gta get ggc ttg 2226 
Thr Lys Lys Thr He Arg Tyr His Pro Pro Glu He Val Ala Gly Leu 
695 700 705 

gat gag eta get eta gca act gaa cat ctt gec att gtg aac cga get 2274 
Asp Glu Leu Ala Leu Ala Thr Glu His Leu Ala He Val Asn Arg Ala 



710 

teg tgg gat 
Ser Trp Asp 



aag get gec 
Lys Ala Ala 



tea act eta 
Ser Thr Leu 
760 

gac tgt gaa 
Asp Cys Glu 
775 

ctg gag act 
Leu Glu Thr 
790 

cat gca gaa 
His Ala Glu 



gga aag age 
Gly Lys Ser 



cag gtt ggt 
Gin Val Gly 
840 

gat ggt gtt 
Asp Gly Val 
855 

aga agt acc 
Arg Ser Thr 
870 

acc tgt tct 
Thr Cys Ser 



act age aca 
Thr Ser Thr 



etc eta gca 
Leu Leu Ala 
920 

gaa ata get 
Glu He Ala 
935 



715 

agt ttc etc 
Ser Phe Leu 
730 

gtt caa get 
Val Gin Ala 
745 

tct aga aac 
Ser Arg Asn 



cca gtt gag 
Pro Val Glu 



ata tta caa 
He Leu Gin 
795 

ggg gaa tat 
Gly Glu Tyr 
810 

tgc tat ate 
Cys Tyr He 
825 

tec ttt gta 
Ser Phe Val 



ttc act egg 
Phe Thr Arg 



ttt eta gaa 
Phe Leu Glu 
875 

tct cgt teg 
Ser Arg Ser 
890 

cac gac ggt 
His Asp Gly 
905 

gaa aag aga 
Glu Lys Arg 



gag ate agt 
Glu He Ser 



aag agt ttc 
Lys Ser Phe 



ctt get gca 
Leu Ala Ala 
750 

aag aac tat 
Lys Asn Tyr 
765 

ata aac ata 
He Asn He 
780 

gat aac ttc 
Asp Asn Phe 



tgc caa att 
Cys Gin He 



cgt caa gtt 
Arg Gin Val 
830 

cca gcg tea 
Pro Ala Ser 
845 

atg ggt get 
Met Gly Ala 
860 

gaa tta agt 
Glu Leu Ser 



ctt gtt ata 
Leu Val He 



gta gee att 
Val Ala He 
910 

tgt ttg gtt 
Cys Leu Val 
925 

aac gga ttc 
Asn Gly Phe 
940 



720 

agt aga tac 
Ser Arg Tyr 
735 

ctg gac tgt 
Leu Asp Cys 



gtc cgt ccc 
Val Arg Pro 



cag tct ggt 
Gin Ser Gly 
785 

gtc cca aat 
Val Pro Asn 
800 

ate acc gga 
He Thr Gly 
815 

get tta att 
Ala Leu He 



ttc gee aag 
Phe Ala Lys 



tea gac agt 
Ser Asp Ser 
865 

gaa gcg tea 
Glu Ala Ser 
880 

tta gat gag 
Leu Asp Glu 
895 

gee tat gca 
Ala Tyr Ala 



ctt ttt gtc 
Leu Phe Val 



cca ggt tct 
Pro Gly Ser 
945 



tac aca gat 
Tyr Thr Asp 
740 

ttg cac tec 
Leu His Ser 
755 

gag ttt gtg 
Glu Phe Val 
770 

cgt cat cct 
Arg His Pro 



gac aca att 
Asp Thr He 



cct aac atg 
Pro Asn Met 
820 

tec ata atg 
Ser He Met 
835 

ctg cac gtg 
Leu His Val 
850 

ate cag cat 
He Gin His 



cac ata ate 
His He He 



ctt gga aga 
Leu Gly Arg 
900 

aca tta cag 
Thr Leu Gin 
915 

acg cat tac 
Thr His Tyr 
930 

gtt ggg aca 
Val Gly Thr 



725 

ttt 2322 
Phe 



ctt 2370 
Leu 



gat 2418 
Asp 



gta 2466 
Val 



ttg 2514 

Leu 

805 

gga 2562 
Gly 



get 2610 
Ala 



ctt 2658 
Leu 



ggc 2706 
Gly 



aga 2 754 

Arg 

885 

ggc 2 802 
Gly 



cat 2850 
His 



cct 2898 
Pro 



tac 2946 
Tyr 



cat gtc teg tat ctg aca ttg cag aag gat aaa ggc agt tat gat cat 2 9 94 
His Val Ser Tyr Leu Thr Leu Gin Lys Asp Lys Gly Ser Tyr Asp His 
950 955 960 965 

gat gat gtg acc tac eta tat aag ctt gtg cgt ggt ctt tgc age agg 3042 
Asp Asp Val Thr Tyr Leu Tyr Lys Leu Val Arg Gly Leu Cys Ser Arg 
970 975 980 

age ttt ggt ttt aag gtt get cag ctt gec cag ata cct cca tea tgt 3090 
Ser Phe Gly Phe Lys Val Ala Gin Leu Ala Gin He Pro Pro Ser Cys 
985 990 995 

ata cgt cga gee att tea atg get gca aaa ttg gaa get gag gta cgt 313 8 
He Arg Arg Ala He Ser Met Ala Ala Lys Leu Glu Ala Glu Val Arg 
1000 1005 1010 

gca aga gag aga aat aca cgc atg gga gaa cca gaa gga cat gaa gaa 3186 
Ala Arg Glu Arg Asn Thr Arg Met Gly Glu Pro Glu Gly His Glu Glu 
1015 1020 1025 

ccg aga ggc gca gaa gaa tct att teg get eta ggt gac ttg ttt gca 3234 
Pro Arg Gly Ala Glu Glu Ser He Ser Ala Leu Gly Asp Leu Phe Ala 
1030 1035 1040 1045 

gac ctg aaa ttt get etc tct gaa gag gac cct tgg aaa gca ttc gag 3282 
Asp Leu Lys Phe Ala Leu Ser Glu Glu Asp Pro Trp Lys Ala Phe Glu 
1050 1055 1060 

ttt tta aag cat get tgg aag att get ggc aaa ate aga eta aaa cca 333 0 
Phe Leu Lys His Ala Trp Lys He Ala Gly Lys He Arg Leu Lys Pro 
1065 1070 1075 

act tgt tea ttt tgatttaatc ttaacattat ageaactgea aggtcttgat 3382 
Thr Cys Ser Phe 
1080 

catctgttag ttgegtacta acttatgtgt attagtataa caagaaaaga gaattagaga 3442 
gatggattct aatccggtgt tgcagtacat cttttctcca cccgcataaa aaaaaaaaaa 3502 
aaaaaaaaaa aaaaaaaaaa 3 522 

<210> 19 
<211> 1081 
<212> PRT 

<213> Arabidopsis thaliana ecotype Columbia 
<220> 

<223> Polypeptide MSH3 
<400> 19 

Met Gly Lys Gin Lys Gin Gin Thr He Ser Arg Phe Phe Ala Pro Lys 

15 10 15 

Pro Lys Ser Pro Thr His Glu Pro Asn Pro Val Ala Glu Ser Ser Thr 

20 25 30 

Pro Pro Pro Lys He Ser Ala Thr Val Ser Phe Ser Pro Ser Lys Arg 

35 40 45 

Lys Leu Leu Ser Asp His Leu Ala Ala Ala Ser Pro Lys Lys Pro Lys 





50 






Leu 


Ser 


Pro 


His 


65 








Arg 


Phe 


Leu 


Gin 


Glu 


Thr 


Ser 


Ser 








100 


Glu 


Leu 


Lys 


Ser 






115 




Tyr 


Arg 


Tyr Arg 




i i r\ 

130 






Leu 


Gly 


11B 


Tyr 


1 A C 

14b 








Pro 


Thr 


Pxie 


Arg 


Tyr 


Lys 


lie 


Gly 








180 


His 


Gly 


Ala 


Asn 






IOC 

lyb 




Tyr 


Thr 


Lys 


Ala 




lift 
210 






Gly 


Gly 


GlU 


GlU 


225 








Val 


Asp 


Glu 


Arg 


Ser 


Phe 


Asp 


Val 








260 


Glu 


Val 


Val 


Tyr 






275 




Glu 


Ala 


Val 


lie 




O f\ 

290 






Pro 


Leu 


Ser 


Gin 


"3 A C 








Thr 


Ser 


Asn 


Val 


Gly 


Asn 


Ala 


Val 








340 


Gly 


Asn 


Leu 


Glu 






355 




Gly 


Met 


Ser 


Cys 




370 






Thr 


Val 


Gin 


Ala 


385 








Phe 


Glu 


Arg 


lie 


Asn 


Thr 


Glu 


Met 








420 


Val 


Lys 


Asn 


Asn 






435 




Met 


Asn 


His 


Thr 




450 






Val 


Thr 


His 


Pro 


465 








Ala 


Val 


Ser 


Glu 


Leu 


Ser 


Ser 


Glu 



500 



55 

Thr Gin Asn Pro 
70 

Arg Phe Leu Glu 
85 

Ser Arg Lys Tyr 

Lys Tyr Pro Asp 
120 

Phe Phe Gly Glu 
135 

Ala His Met Asp 
150 

Leu Asn Phe His 
165 

Val Val Lys Gin 

Arg Thr Gly Pro 
200 

Thr Leu Glu Ala 
215 

Gly Phe Gly Ser 
230 

Val Lys Ser Glu 
245 

Arg Val Gly Val 

Glu Glu Phe Asn 
280 

Leu Ser Leu Ser 
295 

Gin Thr Glu Lys 
310 

Arg Val Glu Arg 
325 

Asp Glu Val He 

Asp Asp Lys Glu 
360 

Leu Thr Val His 
375 

Leu Ala Leu Thr 
390 

Leu Tyr Gin Gly 
405 

Thr Leu Ser Ala 

Ser Asp Gly Ser 
440 

Leu Thr Val Tyr 
455 

Leu Cys Asp Arg 
470 

He Ser Ala Cys 
485 

Leu Val Glu Glu 



60 

Val Pro Asp Pro 
75 

Pro Ser Pro Glu 
90 

Thr Pro Leu Glu 
105 

Val Val Leu Met 

Asp Ala Glu He 
140 

His Asn Phe Met 
155 

Val Arg Arg Leu 
170 

Thr Glu Thr Ala 
185 

Phe Phe Arg Gly 

Ala Glu Asp He 
220 

Gin Ser Asn Phe 
235 

Thr Leu Gly Cys 
250 

Val Gly Val Glu 
265 

Asp Asn Phe Met 

Pro Ala Glu Leu 
300 

Phe Leu Val Ala 
315 

Ala Ser Leu Asp 
330 

Ser Leu Cys Glu 
345 

Met Lys Leu Glu 

Thr He Met Asn 
380 

Phe Cys His Leu 
395 

Ala Ser Phe Arg 
410 

Asn Thr Leu Gin 
425 

Glu Ser Gly Ser 

Gly Ser Arg Leu 
460 

Asn Leu He Ser 
475 

Met Gly Ser His 
490 

Gly Ser Glu Arg 
505 



Asn 


Leu 


His 


Gin 








80 


Glu 


Tvr 


Val 


Pro 






95 




Gin 


Gin 


Val 


Val 




110 






Val 


Glu 


Val 


Glv 


125 








Ala 


Ala 


Arcr 


Val 


Thr 


Ala 


Ser 


Val 








160 


Val 


Asn 


Ala 


Gly 






175 




Ala 


He 


Lys 


Ser 




190 






Leu 


Ser 


Ala 


Leu 


205 








Ser 


Glv 


Gly Cys 


Leu 


Val 


Cys 


Val 








240 


Glv 


He 


Glu 


Met 






255 




He 


Ser 


Thr 


Gly 




270 






Arcr 


Ser 


Gly Leu 


£i Q J 








Leu 


Leu 


Gly Gin 


Met 


Ala 


Gly 


Pro 








320 


Cys 


Phe 


Ser 


Asn 






335 




-i—i y id 


He 


Ser 


Ala 




J JU 






Ala 


Ala 


Glu 


Lys 


3fiS 








Mpf 
l IC u 


Pro 


His 


Leu 


Lys 


Gin 


Phe 


Gly 








400 


Ser 


Leu 


Ser 


Ser 






415 




m n 

<J_l.ll 




Glu 


Val 




4 t n 

T J u 






J-icU 


flic 


His 


Asn 


A A C 

445 








Leu 


Arg 


His 


Trp 


Ala 


Arg 


Leu 


Asp 








480 


Ser 


Ser 


Ser 


Gin 






495 




Ala 


He 


Val 


Ser 




510 







Pro 


Glu 


Pne 


Tyr 


Leu 


Val 


Leu 


Ser 


0 ^ 
Ser 


Val 


Leu 


rr1"U v 

Thr 


Aia 


Met 


Ser Arg 






515 










rr A A 

52 0 










b2 b 








Ser 


Ser 


Asp 


He 


Gin 


Arg 


Gly 


He 


fry l_ _^ 

Thr 


Arg 


lie 


Phe 


HIS 


Arg 


Thr 


Ala 




530 










r rr 

535 










C A A 

b4u 










Lys 


Ala 


Thr 


Glu 


Pne 


He 


Ala 


Val 


Met 


GlU 


71 -1 _ 
Ala 


lie 


Leu 


Leu 


Ala Gly 


545 










CCA 

550 










bbb 










O O VJ 


Lys 


Gin 


He 


Gin 


Arg 


Leu 


Gly 


-T- ~J _ 

He 


Lys 


Gin 


Asp 


Ser 


GIU 


Met 


rtX y 


OCX 










565 










C T A 

b /0 










D / D 




Met 


Gin 


Ser 


Ala 


Thr 


17. T 

Val 


Arg 


Ser 


Thr 


Leu 


Leu 


Arg 


Lys 


T mil 

Leu 


Tie 
lie 


OCl 








f~ 0 r\ 

580 










CDC 

bob 










can 
b y u 






Val 


TP T — 

He 


Ser 


Ser 


Pro 


Val 


Val 


val 


Asp 


Asn 


Ala 


Gly 


Lys 


Leu 


UC Li 








595 










ez A n 










bUb 








Ala 


Leu 


Asn 


Lys 


CjlU 


Ala 


Ala 


vai 


Arg 


Giy 


Asp 


Leu 


Leu 


Asp 


lie 


llC u. 




610 










615 










C *5 A 










lie 


Tnr 


Ser 


Ser 


Asp 


Gin 


Pne 


Pro 


G±U 


Leu 


Aia 


LjIU 


Aia 


Arg 




J-VXCt 












an 










a c 

D J J 










O *i U 


Val 


Leu 


Val 


He 


Arg 


Glu 


Lys 


Leu 


Asp 


Ser 


Ser 


He 


Ala 


Ser 


rile 


Arg 










645 










CCA 










c, c 




Lys 


Lys 


Leu 


Ala 


He 


Arg 


Asn 


Leu 


Glu 


Phe 


Leu 


Gin 


T/- 1 

Val 


A v< 

Ser 


uiy 


T 1 0 
11c 








£ C A 

660 










c c c 
bob 










b / u 






Thr 


His 


Leu 


He 


Glu 


Leu 


Pro 


val 


Asp 


Ser 


T -r fA 

Lys 


Val 


Pro 


nlS 


Asn 


lrp 






O / b 




















bob 








IT- T 

val 


Lys 


vai 


Asn 


Ser 


inr 


Lys 


Lys 


mr 


lie 


Arg 


Tyr 


T.J ■! ri 
HIS 


Pro 


Pro 


ulU 




c a a 










6 yb 










inn 










Tin 

lie 


vai 


Ala 


Gly 


Leu 


Asp 


GiU 


Leu 


Aia 


T ni 1 

Leu 


Aia 


inr 


GIU 


U -i n 

illS 


lieu 


Hid 


i a c 

70b 










1 1 A 

/ 10 










HI P 

/lb 










1 ~i f\ 


lie 


val 


Asn 


Arg 


71 T _ 

Ala 


Ser 


Trp 


7\ _ „ 

Asp 


Ser 


Phe 


Leu 


Lys 


Ser 


pne 


Gov* 
□ CI 


arg 










725 










73 0 










735 




Tyr 


Tyr 


Thr 


Asp 


Phe 


Lys 


Ala 


Ala 


Val 


Gin 


Ala 


Leu 


Ala 


Ala 


Leu 


Asp 








740 










745 










1 C A 

750 






Cys 


Leu 


His 


Ser 


Leu 


Ser 


Thr 


Leu 


Ser 


Arg 


Asn 


T - .M 

Lys 


Asn 


Tyr 


Val 


Arg 






n r p 

7bb 










/oO 










n c c 
/Ob 








Pro 


Glu 


Phe 


Val 


Asp 


Asp 


Cys 


Glu 


Pro 


Val 


Glu 


He 


Asn 


He 


Gin 


Ser 




770 










775 










»-i j-i 

780 










Gly 


Arg 


His 


Pro 


Val 


Leu 


Glu 


Thr 


He 


Leu 


Gin 


Asp 


Asn 


Phe 


Val 


Pro 


HOC 

/ Ob 










/ y u 










/y b 










800 


Asn 


Asp 


Tnr 


He 


Leu 


His 


Ala 


Glu 


Gly 


Glu 


Tyr 


Cys 


Gin 


He 


He 


Thr 










one 

805 










810 










815 




Gly 


Pro 


Asn 


Met 


Gly 


Gly 


Lys 


Ser 


Cys 


Tyr 


He 


Arg 


Gin 


vai 


Ala 


Leu 








0 n A 

82 0 










OTP 

82b 










O 1 A 

83 0 






lie 


Ser 


He 


Kit — . 4- 

Met 


TV T — 

Ala 


Gin 


Val 


Gly 


Ser 


Phe 


Val 


Pro 


Ala 


Ser 


Phe 


Ala 






D "3 C 










O A A 










84 5 








Lys 


Leu 


His 


Val 


Leu 


Asp 


Gly 


Val 


Phe 


Thr 


Arg 


Met 


Gly 


Ala 


Ser Asp 




850 










855 










860 










Ser 


He 


Gin 


His 


Gly 


Arg 


Ser 


Thr 


Phe 


Leu 


Glu 


Glu 


Leu 


Ser 


Glu 


Ala 


865 










870 










875 










880 


Ser 


TT- _ 

His 


He 


He 


Arg 


Thr 


Cys 


Ser 


Ser 


Arg 


Ser 


Leu 


Val 


He 


Leu 


Asp 










885 










h n A 

890 










895 




Glu 


Leu 


Gly 


Arg 


Gly 


Thr 


Ser 


Thr 


His 


Asp 


Gly 


Val 


Ala 


He 


Ala 


Tyr 








you 










O A C 

y ub 










AIA 
910 






Ala 


Thr 


Leu 


Gin 


His 


Leu 


Leu 


Ala 


Glu 


Lys 


Arg 


Cys 


Leu 


Val 


Leu 


Phe 






915 










920 










925 








Val 


Thr 


His 


Tyr 


Pro 


Glu 


He 


Ala 


Glu 


He 


Ser 


Asn 


Gly 


Phe 


Pro Gly 




930 










935 










940 










Ser 


Val 


Gly 


Thr 


Tyr 


His 


Val 


Ser 


Tyr 


Leu 


Thr 


Leu 


Gin 


Lys 


Asp 


Lys 


945 










950 










955 










960 


Gly 


Ser 


Tyr 


Asp 


His 


Asp 


Asp 


Val 


Thr 


Tyr 


Leu 


Tyr 


Lys 


Leu 


Val 


Arg 



965 970 975 

Gly Leu Cys Ser Arg Ser Phe Gly Phe Lys Val Ala Gin Leu Ala Gin 

980 985 990 

lie Pro Pro Ser Cys lie Arg Arg Ala He Ser Met Ala Ala Lys Leu 

995 1000 1005 

Glu Ala Glu Val Arg Ala Arg Glu Arg Asn Thr Arg Met Gly Glu Pro 

1010 1015 1020 

Glu Gly His Glu Glu Pro Arg Gly Ala Glu Glu Ser He Ser Ala Leu 
1025 1030 1035 1040 

Gly Asp Leu Phe Ala Asp Leu Lys Phe Ala Leu Ser Glu Glu Asp Pro 

1045 1050 1055 

Trp Lys Ala Phe Glu Phe Leu Lys His Ala Trp Lys lie Ala Gly Lys 

1060 1065 1070 

He Arg Leu Lys Pro Thr Cys Ser Phe 
1075 1080 



<210> 20 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> MSH6 specific primer 63 8 for PCR using cDNA of 
Arabidopsis thaliana ecotype Columbia. 

<400> 20 

tctctaccag gtgacgaaaa accg 

<210> 21 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer S81 for PCR using cDNA of Arabidopsis 
thaliana ecotype Columbia. 

<400> 21 

cgtcgccttt agcatcccct tccttcac 

<210> 22 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> MSH6 specific primer S823 for PCR using cDNA of 
Arabidopsis thaliana ecotype Columbia. 

<400> 22 

gcttggcgca tctaatagaa tcatgacagg 

<210> 23 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> MSH6 specific primer 637 for PCR using cDNA of 
Arabidopsis thaliana ecotype Columbia. 



<400> 23 

gacagcgtca gttcttcaga atgc 24 

<210> 24 
<211> 33 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> MSH6 specific primer IS 8 for PCR using cDNA of 
Arabidopsis thaliana ecotype Columbia. 



<400> 24 

atcccgggat gcagcgccag agatcgattt tgt 33 

<210> 25 

<211> 27 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> MSH6 specific primer S83 for PCR using cDNA of 
Arabidopsis thaliana ecotype Columbia. 

<400> 25 

cgctatctat ggctgcttcg aattgag 27 

<210> 26 
<211> 2188 
<212> DNA 

<213> Arabidopsis thaliana ecotype Columbia 



<220> 

<221> misc_feature 
<222> (0) . . . (0) 
<223> Clone 43 



<400> 26 

cccgggatgc agcgccagag atcgattttg tctttcttcc aaaaacccac ggcggcgact 60 
acgaagggtt tggtttccgg cgatgctgct agcggcgggg gcggcagcgg aggaccacga 12 0 
tttaatgtga aggaagggga tgctaaaggc gacgcttctg tacgttttgc tgtttcgaaa 180 
tctgtcgatg aggttagagg aacggatact ccaccggaga aggttccgcg tcgtgtcctg 240 
ccgtctggat ttaagccggc tgaatccgcc ggtgatgctt cgtccctgtt ctccaatatt 3 00 
atgcataagt ttgtaaaagt cgatgatcga gattgttctg gagagaggag ccgagaagat 3 60 
gttgttccgc tgaatgattc atctctatgt atgaaggcta atgatgttat tcctcaattt 420 
cgttccaata atggtaaaac tcaagaaaga aaccatgctt ttagtttcag tgggagagct 480 
gaacttagat cagtagaaga tataggagta gatggcgatg ttcctggtcc agaaacacca 54 0 
gggatgcgtc cacgtgcttc tcgcttgaag cgagttctgg aggatgaaat gacttttaag 600 
gaggataagg ttcctgtatt ggactctaac aaaaggctga aaatgctcca ggatccggtt 660 
tgtggagaga agaaagaagt aaacgaagga accaaatttg aatggcttga gtcttctcga 72 0 
atcagggatg ccaatagaag acgtcctgat gatccccttt acgatagaaa gaccttacac 780 
ataccacctg atgttttcaa gaaaatgtct gcatcacaaa agcaatattg gagtgttaag 840 



agtgaatata tggacattgt gcttttcttt aaagtgggga aattttatga gctgtatgag 900 
ctagatgcgg aattaggtca caaggagctt gactggaaga tgaccatgag tggtgtggga 960 
aaatgcagac aggttggtat ctctgaaagt gggatagatg aggcagtgca aaagctatta 102 0 
gctcgtggat ataaagttgg acgaatcgag cagctagaaa catctgacca agcaaaagcc 1080 
agaggtgcta atactataat tccaaggaag ctagttcagg tattaactcc atcaacagca 1140 
agcgagggaa acatcgggcc tgatgccgtc catcttcttg ctataaaaga gatcaaaatg 1200 
gagctacaaa agtgttcaac tgtgtatgga tttgcttttg ttgactgtgc tgccttgagg 1260 
ttttgggttg ggtccatcag cgatgatgca tcatgtgctg ctcttggagc gttattgatg 1320 
caggtttctc caaaggaagt gttatatgac agtaaagggc tatcaagaga agcacaaaag 1380 
gctctaagga aatatacgtt gacagggtct acggcggtac agttggctcc agtaccacaa 1440 
gtaatggggg atacagatgc tgctggagtt agaaatataa tagaatctaa cggatacttt 1500 
aaaggttctt ctgaatcatg gaactgtgct gttgatggtc taaatgaatg tgatgttgcc 1560 
cttagtgctc ttggagagct aattaatcat ctgtctaggc taaagctaga agatgtactt 1620 
aagcatgggg atatttttcc ataccaagtt tacaggggtt gtctcagaat tgatggccag 1680 
acgatggtaa atcttgagat atttaacaat agctgtgatg gtggtccttc agggaccttg 1740 
tacaaatatc ttgataactg tgttagtcca actggtaagc gactcttaag gaattggatc 1800 
tgccatccac tcaaagatgt agaaagcatc aataaacggc ttgatgtagt tgaagaattc 1860 
acggcaaact cagaaagtat gcaaatcact ggccagtatc tccacaaact tccagactta 1920 
gaaagactgc tcggacgcat caagtctagc gttcgatcat cagcctctgt gttgcctgct 1980 
cttctgggga aaaaagtgct gaaacaacga gttaaagcat ttgggcaaat tgtgaaaggg 204 0 
ttcagaagtg gaattgatct gttgttggct ctacagaagg aatcaaatat gatgagtttg 2100 
ctttataaac tctgtaaact tcctatatta gtaggaaaaa gcgggctaga gttatttctt 2160 
tctcaattcg aagcagccat agatagcg 2188 

<210> 27 
<211> 1385 
<212> DNA 

<213> Arabidopsis thaliana ecotype Columbia 
<220> 

<221> misc_feature 
<222> (0) . . . (0) 
<223> Clone 62 

<400> 27 

catcagcctc tgtgttgcct gctcttctgg ggaaaaaagt gctgaaacaa cgagttaaag 60 
catttgggca aattgtgaaa gggttcagaa gtggaattga tctgttgttg gctctacaga 120 
aggaatcaaa tatgatgagt ttgctttata aactctgtaa acttcctata ttagtaggaa 180 
aaagcgggct agagttattt ctttctcaat tcgaagcagc catagatagc gactttccaa 240 
attatcagaa ccaagatgtg acagatgaaa acgctgaaac tctcacaata cttatcgaac 3 00 
tttttatcga aagagcaact caatggtctg aggtcattca caccataagc tgcctagatg 360 
tcctgagatc ttttgcaatc gcagcaagtc tctctgctgg aagcatggcc aggcctgtta 420 
tttttcccga atcagaagct acagatcaga atcagaaaac aaaagggcca atacttaaaa 4 80 
tccaaggact atggcatcca tttgcagttg cagccgatgg tcaattgcct gttccgaatg 540 
atatactcct tggcgaggct agaagaagca gtggcagcat tcatcctcgg tcattgttac 600 
tgacgggacc aaacatgggc ggaaaatcaa ctcttcttcg tgcaacatgt ctggccgtta 660 
tctttgccca acttggctgc tacgtgccgt gtgagtcttg cgaaatctcc ctcgtggata 720 
ctatcttcac aaggcttggc gcatctgata gaatcatgac aggagagagt acctttttgg 780 
tagaatgcac tgagacagcg tcagttcttc agaatgcaac tcaggattca ctagtaatcc 840 
ttgacgaact gggcagagga actagtactt tcgatggata cgccattgca tactcggttt 900 
ttcgtcacct ggtagagaaa gttcaatgtc ggatgctctt tgcaacacat taccaccctc 960 
tcaccaagga attcgcgtct cacccacgtg tcacctcgaa acacatggct tgcgcattca 1020 
aatcaagatc tgattatcaa ccacgtggtt gtgatcaaga cctagtgttc ttgtaccgtt 1080 
taaccgaggg agcttgtcct gagagctacg gacttcaagt ggcactcatg gctggaatac 1140 
caaaccaagt ggttgaaaca gcatcaggtg ctgctcaagc catgaagaga tcaattgggg 1200 
aaaacttcaa gtcaagtgag ctaagatctg agttctcaag tctgcatgaa gactggctca 1260 
agtcattggt gggtatttct cgagtcgccc acaacaatgc ccccattggc gaagatgact 13 2 0 



acgacacttt gttttgctta tggcatgaga tcaaatcctc ttactgtgtt cccaaataac 13 80 



ccggg 13 85 

<210> 28 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> MSH6 specific primer 2S8 for PCR using cDNA of 
Arabidopsis thaliana ecotype Columbia. 

<400> 28 

atcccgggtt atttgggaac acagtaagag gatt 34 

<210> 29 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> MSH6 specific primer S82 for PCR using cDNA of 
Arabidopsis thaliana ecotype Columbia. 

<400> 29 

gcgttcgatc atcagcctct gtgttgc 27 

<210> 30 
<211> 3606 
<212> DNA 

<213> Arabidopsis thaliana ecotype Columbia 

<220> 
<221> CDS 

<222> (142) . . . (3468) 

<223> AtMSH6 full-length cDNA and deduced sequence of 
the encoded polypeptide. 

<400> 30 

aaaagttgag ccctgaggag tatcgtttcc gccatttcta cgacgcaagg cgaaaatttt 60 
tggcgccaat ctttcccccc tttcgaattc tctcagctca aaacatcgtt tctctctcac 120 
tctctctcac aattccaaaa a atg cag cgc cag aga teg att ttg tct ttc 171 

Met Gin Arg Gin Arg Ser lie Leu Ser Phe 
15 10 

ttc caa aaa ccc acc gcg gcg act acg aag ggt ttg gtt tec ggc gat 219 
Phe Gin Lys Pro Thr Ala Ala Thr Thr Lys Gly Leu Val Ser Gly Asp 
15 20 25 

get get age ggc ggg ggc ggc age gga gga cca cga ttt aat gtg aag 267 
Ala Ala Ser Gly Gly Gly Gly Ser Gly Gly Pro Arg Phe Asn Val Arg 
30 35 40 



gaa ggg gat get aaa ggc gac get tct gta cgt ttt get gtt teg aaa 
Glu Gly Asp Ala Lys Gly Asp Ala Ser Val Arg Phe Ala Val Ser Lys 
45 50 55 



315 



tct gtc gat gag gtt aga gga acg gat act cca ccg gag aag gtt ccg 363 
Ser Val Asp Glu Val Arg Gly Thr Asp Thr Pro Pro Glu Lys Val Pro 
60 65 70 

cgt cgt gtc ctg ccg tct gga ttt aag ccg get gaa tec gec gst gat 411 
Arg Arg Val Leu Pro Ser Gly Phe Lys Pro Ala Glu Ser Ala Gly Asp 
75 80 85 90 

get teg tec ctg ttc tec aat att atg cat aag ttt gta aaa gtc gat 459 
Ala Ser Ser Leu Phe Ser Asn lie Met His Lys Phe Val Lys Val Asp 
95 100 105 

gat cga gat tgt tct gga gag agg age cga gaa gat gtt gtt ccg ctg 507 
Asp Arg Asp Cys Ser Gly Glu Arg Ser Arg Glu Asp Val Val Pro Leu 
110 115 120 

aat gat tea tct eta tgt atg aag get aat gat gtt att cct caa ttt 555 
Asn Asp Ser Ser Leu Cys Met Lys Ala Asn Asp Val lie Pro Gin Phe 
125 130 135 

cgt tec aat aat ggt aaa act caa gaa aga aac cat get ttt agt ttc 603 
Arg Ser Asn Asn Gly Lys Thr Gin Glu Arg Asn His Ala Phe Ser Phe 
140 145 150 

a 9t 999 a 9^ get gaa ctt aga tea gta gaa gat ata gga gta gat ggc 651 
Ser Gly Arg Ala Glu Leu Arg Ser Val Glu Asp lie Gly Val Asp Gly 
155 160 165 170 

gat gtt cct ggt cca gaa aca cca ggg atg cgt cca cgt get tct cgc 699 
Asp Val Pro Gly Pro Glu Thr Pro Gly Met Arg Pro Arg Ala Ser Arg 
175 180 185 

ttg aag cga gtt ctg gag gat gaa atg act ttt aag gag gat aag gtt 747 
Leu Lys Arg Val Leu Glu Asp Glu Met Thr Phe Lys Glu Asp Lys Val 
190 195 200 

cct gta ttg gac tct aac aaa agg ctg aaa atg etc cag gat ccg gtt 795 
Pro Val Leu Asp Ser Asn Lys Arg Leu Lys Met Leu Gin Asp Pro Val 
205 210 215 

tgt gga gag aag aaa gaa gta aac gaa gga ace aaa ttt gaa tgg ctt 843 
Cys Gly Glu Lys Lys Glu Val Asn Glu Gly Thr Lys Phe Glu Trp Leu 
220 225 230 

gag tct tct cga ate agg gat gee aat aga aga cgt cct gat gat ccc 891 
Glu Ser Ser Arg He Arg Asp Ala Asn Arg Arg Arg Pro Asp Asp Pro 
235 240 245 250 

ctt tac gat aga aag acc tta cac ata cca cct gat gtt ttc aag aaa 939 
Leu Tyr Asp Arg Lys Thr Leu His He Pro Pro Asp Val Phe Lys Lys 
255 260 265 

atg tct gca tea caa aag caa tat tgg agt gtt aag agt gaa tat atg 987 
Met Ser Ala Ser Gin Lys Gin Tyr Trp Ser Val Lys Ser Glu Tyr Met 
270 275 280 

gac att gtg ctt ttc ttt aaa gtg ggg aaa ttt tat gag ctg tat gag 1035 



Asp He Val 
285 

eta gat gcg 
Leu Asp Ala 
300 

agt ggt gtg 
Ser Gly Val 
315 

gat gag gca 
Asp Glu Ala 



ate gag cag 
He Glu Gin 



act ata att 
Thr He He 
365 

age gag gga 
Ser Glu Gly 
380 

gag ate aaa 
Glu He Lys 
3 95 

ttt gtt gac 
Phe Val Asp 



gat gca tea 
Asp Ala Ser 



aag gaa gtg 
Lys Glu Val 
445 

get eta agg 
Ala Leu Arg 
460 

cca gta cca 
Pro Val Pro 
475 

ata ata gaa 
He He Glu 



tgt get gtt 
Cys Ala Val 



Leu Phe Phe 



gaa tta ggt 
Glu Leu Gly 



gga aaa tgc 
Gly Lys Cys 
320 

gtg caa aag 
Val Gin Lys 
335 

eta gaa aca 
Leu Glu Thr 
350 

cca agg aag 
Pro Arg Lys 



aac ate ggg 
Asn He Gly 



atg gag eta 
Met Glu Leu 
400 

tgt get gec 
Cys Ala Ala 
415 

tgt get get 
Cys Ala Ala 
430 

tta tat gac 
Leu Tyr Asp 



aaa tat acg 
Lys Tyr Thr 



caa gta atg 
Gin Val Met 
480 

tct aac gga 
Ser Asn Gly 
495 

gat ggt eta 
Asp Gly Leu 



Lys Val Gly 
290 

cac aag gag 
His Lys Glu 
305 

aga cag gtt 
Arg Gin Val 



eta tta get 
Leu Leu Ala 



tct gac caa 
Ser Asp Gin 
355 

eta gtt cag 
Leu Val Gin 
370 

cct gat gee 
Pro Asp Ala 
385 

caa aag tgt 
Gin Lys Cys 



ttg agg ttt 
Leu Arg Phe 



ctt gga gcg 
Leu Gly Ala 
435 

agt aaa ggg 
Ser Lys Gly 
450 

ttg aca ggg 
Leu Thr Gly 
465 

ggg gat aca 
Gly Asp Thr 



tac ttt aaa 
Tyr Phe Lys 



aat gaa tgt 
Asn Glu Cys 



Lys Phe Tyr 



ctt gac tgg 
Leu Asp Trp 
310 

ggt ate tct 
Gly He Ser 
325 

cgt gga tat 
Arg Gly Tyr 
340 

gca aaa gec 
Ala Lys Ala 



gta tta act 
Val Leu Thr 



gtc cat ctt 
Val His Leu 
390 

tea act gtg 
Ser Thr Val 
405 

tgg gtt ggg 
Trp Val Gly 
420 

tta ttg atg 
Leu Leu Met 



eta tea aga 
Leu Ser Arg 



tct acg gcg 
Ser Thr Ala 
470 

gat get get 
Asp Ala Ala 
485 

ggt tct tct 
Gly Ser Ser 
500 

gat gtt gee 
Asp Val Ala 



Glu Leu Tyr 
295 

aag atg acc 
Lys Met Thr 



gaa agt ggg 
Glu Ser Gly 



aaa gtt gga 
Lys Val Gly 
345 

aga ggt get 
Arg Gly Ala 
360 

cca tea aca 
Pro Ser Thr 
375 

ctt get ata 
Leu Ala He 



tat gga ttt 
Tyr Gly Phe 



tec ate age 
Ser He Ser 
425 

cag gtt tct 
Gin Val Ser 
440 

gaa gca caa 
Glu Ala Gin 
455 

gta cag ttg 
Val Gin Leu 



gga gtt aga 
Gly Val Arg 



gaa tea tgg 
Glu Ser Trp 
505 

ctt agt get 
Leu Ser Ala 



Glu 



atg 1083 
Met 



ata 1131 

He 

330 

cga 1179 
Arg 



aat 1227 
Asn 



gca 1275 
Ala 



aaa 1323 
Lys 



get 1371 

Ala 

410 

gat 1419 
Asp 



cca 1467 
Pro 



aag 1515 
Lys 



get 1563 
Ala 



aat 1611 

Asn 

490 

aac 1659 
Asn 



ctt 1707 
Leu 



510 515 520 

gga gag eta att aat cat ctg tct agg eta aag eta gaa gat gta ctt 1755 
Gly Glu Leu lie Asn His Leu Ser Arg Leu Lys Leu Glu Asp Val Leu 
525 530 535 

aag cat ggg gat att ttt cca tac caa gtt tac agg ggt tgt etc aga 1803 
Lys His Gly Asp He Phe Pro Tyr Gin Val Tyr Arg Gly Cys Leu Arg 
540 545 550 

att gat ggc cag acg atg gta aat ctt gag ata ttt aac aat age tgt 1851 
He Asp Gly Gin Thr Met Val Asn Leu Glu He Phe Asn Asn Ser Cys 
555 560 565 570 

gat ggt ggt cct tea ggg ace ttg tac aaa tat ctt gat aac tgt gtt 1899 
Asp Gly Gly Pro Ser Gly Thr Leu Tyr Lys Tyr Leu Asp Asn Cys Val 
575 580 585 

agt cca act ggt aag cga etc tta agg aat tgg ate tgc cat cca etc 1947 
Ser Pro Thr Gly Lys Arg Leu Leu Arg Asn Trp He Cys His Pro Leu 
590 595 600 

aaa gat gta gaa age ate aat aaa egg ctt gat gta gtt gaa gaa ttc 1995 
Lys Asp Val Glu Ser He Asn Lys Arg Leu Asp Val Val Glu Glu Phe 
605 610 615 

acg gca aac tea gaa agt atg caa ate act ggc cag tat etc cac aaa 2043 
Thr Ala Asn Ser Glu Ser Met Gin He Thr Gly Gin Tyr Leu His Lys 
620 625 630 

ctt cca gac tta gaa aga ctg etc gga cgc ate aag tct age gtt cga 2091 
Leu Pro Asp Leu Glu Arg Leu Leu Gly Arg He Lys Ser Ser Val Arg 
635 640 645 650 

tea tea gec tct gtg ttg cct get ctt ctg ggg aaa aaa gtg ctg aaa 213 9 
Ser Ser Ala Ser Val Leu Pro Ala Leu Leu Gly Lys Lys Val Leu Lys 
655 660 665 

caa cga gtt aaa gca ttt ggg caa att gtg aaa ggg ttc aga agt gga 2187 
Gin Arg Val Lys Ala Phe Gly Gin He Val Lys Gly Phe Arg Ser Gly 
670 675 680 

att gat ctg ttg ttg get eta cag aag gaa tea aat atg atg agt ttg 2235 
He Asp Leu Leu Leu Ala Leu Gin Lys Glu Ser Asn Met Met Ser Leu 
685 690 695 

ctt tat aaa etc tgt aaa ctt cct ata tta gta gga aaa age ggg eta 2283 
Leu Tyr Lys Leu Cys Lys Leu Pro He Leu Val Gly Lys Ser Gly Leu 
700 705 710 

gag tta ttt ctt tct caa ttc gaa gca gec ata gat age gac ttt cca 2331 
Glu Leu Phe Leu Ser Gin Phe Glu Ala Ala He Asp Ser Asp Phe Pro 
715 720 725 730 

aat tat cag aac caa gat gtg aca gat gaa aac get gaa act etc aca 2379 
Asn Tyr Gin Asn Gin Asp Val Thr Asp Glu Asn Ala Glu Thr Leu Thr 
735 740 745 



ata ctt ate gaa ctt ttt ate gaa aga gca act caa tgg tct gag gtc 2427 
lie Leu He Glu Leu Phe He Glu Arg Ala Thr Gin Trp Ser Glu Val 
750 755 760 

att cac acc ata age tgc eta gat gtc ctg aga tct ttt gca ate gca 2475 
He His Thr He Ser Cys Leu Asp Val Leu Arg Ser Phe Ala He Ala 
765 770 775 

gca agt etc tct get gga age atg gee agg cct gtt att ttt ccc gaa 2523 
Ala Ser Leu Ser Ala Gly Ser Met Ala Arg Pro Val He Phe Pro Glu 
780 785 790 

tea gaa get aca gat cag aat cag aaa aca aaa ggg cca ata ctt aaa 2571 
Ser Glu Ala Thr Asp Gin Asn Gin Lys Thr Lys Gly Pro lie Leu Lys 
795 800 805 810 

ate caa gga eta tgg cat cca ttt gca gtt gca gec gat ggt caa ttg 2619 
He Gin Gly Leu Trp His Pro Phe Ala Val Ala Ala Asp Gly Gin Leu 
815 820 825 

cct gtt ccg aat gat ata etc ctt ggc gag get aga aga age agt ggc 2667 
Pro Val Pro Asn Asp He Leu Leu Gly Glu Ala Arg Arg Ser Ser Gly 
830 835 840 

age att cat cct egg tea ttg tta ctg acg gga cca aac atg ggc gga 2715 
Ser He His Pro Arg Ser Leu Leu Leu Thr Gly Pro Asn Met Gly Gly 
845 850 855 

aaa tea act ctt ctt cgt gca aca tgt ctg gec gtt ate ttt gec caa 2763 
Lys Ser Thr Leu Leu Arg Ala Thr Cys Leu Ala Val He Phe Ala Gin 
860 865 870 

ctt ggc tgc tac gtg ccg tgt gag tct tgc gaa ate tec etc gtg gat 2811 
Leu Gly Cys Tyr Val Pro Cys Glu Ser Cys Glu He Ser Leu Val Asp 
875 880 885 890 

act ate ttc aca agg ctt ggc gca tct gat aga ate atg aca gga gag 2859 
Thr He Phe Thr Arg Leu Gly Ala Ser Asp Arg He Met Thr Gly Glu 
895 900 905 

agt acc ttt ttg gta gaa tgc act gag aca gcg tea gtt ctt cag aat 2907 
Ser Thr Phe Leu Val Glu Cys Thr Glu Thr Ala Ser Val Leu Gin Asn 
910 915 920 

gca act cag gat tea eta gta ate ctt gac gaa ctg ggc aga gga act 2955 
Ala Thr Gin Asp Ser Leu Val He Leu Asp Glu Leu Gly Arg Gly Thr 
925 930 935 

agt act ttc gat gga tac gec att gca tac teg gtt ttt cgt cac ctg 3003 
Ser Thr Phe Asp Gly Tyr Ala He Ala Tyr Ser Val Phe Arg His Leu 
940 945 950 

gta gag aaa gtt caa tgt egg atg etc ttt gca aca cat tac cac cct 3051 
Val Glu Lys Val Gin Cys Arg Met Leu Phe Ala Thr His Tyr His Pro 
955 960 965 970 



« 



etc acc aag gaa ttc gcg tct cac cca cgt gtc acc teg aaa cac atg 3099 
Leu Thr Lys Glu Phe Ala Ser His Pro Arg Val Thr Ser Lys His Met 
975 980 985 

get tgc gca ttc aaa tea aga tct gat tat caa cca cgt ggt tgt gat 3147 
Ala Cys Ala Phe Lys Ser Arg Ser Asp Tyr Gin Pro Arg Gly Cys Asp 
990 995 1000 

caa gac eta gtg ttc ttg tac cgt tta acc gag gga get tgt cct gag 3195 
Gin Asp Leu Val Phe Leu Tyr Arg Leu Thr Glu Gly Ala Cys Pro Glu 
1005 1010 1015 

age tac gga ctt caa gtg gca etc atg get gga ata cca aac caa gtg 324 3 
Ser Tyr Gly Leu Gin Val Ala Leu Met Ala Gly lie Pro Asn Gin Val 
1020 1025 1030 

gtt gaa aca gca tea ggt get get caa gee atg aag aga tea att ggg 3291 
Val Glu Thr Ala Ser Gly Ala Ala Gin Ala Met Lys Arg Ser lie Gly 
1035 1040 1045 1050 

gga aac ttc aag tea agt gag eta aga tct gag ttc tea agt ctg cat 3339 
Glu Asn Phe Lys Ser Ser Glu Leu Arg Ser Glu Phe Ser Ser Leu His 
1055 1060 1065 

gaa gac tgg etc aag tea ttg gtg ggt att tct cga gtc gec cac aac 3 3 87 
Glu Asp Trp Leu Lys Ser Leu Val Gly lie Ser Arg Val Ala His Asn 
1070 1075 1080 

aat gee ccc att ggc gaa gat gac tac gac act ttg ttt tgc tta tgg 3435 
Asn Ala Pro lie Gly Glu Asp Asp Tyr Asp Thr Leu Phe Cys Leu Trp 
1085 1090 1095 

cat gag ate aaa tec tct tac tgt gtt ccc aaa taaatggcta tgacataaca 3488 
His Glu lie Lys Ser Ser Tyr Cys Val Pro Lys 
1100 1105 

ctatctgaag ctcgttaagt cttttgcctc tctgatgttt attcctctta aaaaatgett 3548 
atatatcaaa aaattgtttc ctcgattaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 3606 

<210> 31 
<211> 1109 
<212> PRT 

<213> Arabidopsis thaliana ecotype Columbia 
<220> 

<223> Polypeptide MSH6 
<400> 31 

Met Gin Arg Gin Arg Ser He Leu Ser Phe Phe Gin Lys Pro Thr Ala 

15 10 15 

Ala Thr Thr Lys Gly Leu Val Ser Gly Asp Ala Ala Ser Gly Gly Gly 

20 25 30 

Gly Ser Gly Gly Pro Arg Phe Asn Val Arg Glu Gly Asp Ala Lys Gly 

35 40 45 

Asp Ala Ser Val Arg Phe Ala Val Ser Lys Ser Val Asp Glu Val Arg 

50 55 60 

Gly Thr Asp Thr Pro Pro Glu Lys Val Pro Arg Arg Val Leu Pro Ser 



65 






70 








75 








80 


m v 


Phe 


Lys 


Pro Ala Glu 


Ser Ala Gly Asp Ala 


Ser 


Ser 


Leu 


Phe Ser 








85 






90 










95 


7i an 


He 


Met 


His Lys Phe 


Val Lvs 


Val 


Asp Asp 


Arq 


Asp 


Cys 


Ser Gly 








100 




105 










110 




ox u 


Arg Ser Arg Glu Asp 


Val Val 


Pro 


Leu 


Asn 


Asp 


Ser 


Ser 


Leu Cys 






115 




120 










125 








Lys 


Ala 


Asn Asp Val 


He Pro 


Gin 


Phe 


Arg 


Ser 


Asn 


Asn 


Gly Lys 




130 






135 








14 0 








Thr 


Gin 


Glu 


Arg Asn His 


Ala Phe 


Ser 


Phe 


Ser 


Gly 


Arg 


Ala 


Glu Leu 


145 






150 








155 








160 




Ser 


Val 


Glu Asp He 


Gly Val 


Asp Gly Asp 


Val 


Pro 


Gly 


Pro Glu 








165 






170 










175 


Thr 


Pro Gly Met Arg Pro 


Arg Ala 


Ser Arg 


Leu 


Lys 


Arg 


Val 


Leu Glu 








180 




185 










190 






Glu 


Met 


Thr Phe Lys 


Glu Asp 


Lys 


Val 


Pro 


Val 


Leu 


Asp 


Ser Asn 






195 




200 










205 








Arg 


Leu 


Lys Met Leu 


Gin Asp 


Pro 


Val 


Cys 


Gly 


Glu 


Lys 


Lys Glu 




210 






215 








220 








Val 


Asn Glu Gly Thr Lys 


Phe Glu Trp 


Leu 


Glu 


Ser 


Ser 


Arg 


He Arg 


225 






230 








235 








240 


Asp 


Ala Asn Arg Arg Arg 


Pro Asp 


Asp 


Pro 


Leu 


Tyr 


Asp 


Arg 


Lys Thr 








245 






250 










255 


Leu 


His 


He 


Pro Pro Asp 


Val Phe 


Lys 


Lys 


Met 


Ser 


Ala 


Ser 


Gin Lys 








260 




265 










270 




Gin 


Tyr Trp 


Ser Val Lys 


Ser Glu 


Tyr 


Met 


Asp 


He 


Val 


Leu 


Phe Phe 






275 




280 










285 






Lys 


Val 


Gly 


Lys Phe Tyr 


Glu Leu 


Tyr 


Glu 


Leu 


Asp 


Ala 


Glu 


Leu Gly 




290 






295 








300 








His 


Lys 


Glu 


Leu Asp Trp 


Lys Met 


Thr 


Met 


Ser 


Glv 


Val 


Glv 


Lys Cys 


305 






310 








315 








320 




Gin 


Val 


Gly He Ser 


Glu Ser 


Gly 


He 


Asp 


Glu 


Ala 


Val 


Gin Lys 








325 






330 










335 


Leu 


Leu 


Ala 


Arg Gly Tyr 


Lys Val 


Gly Arg 


He 


Glu 


Gin 


Leu 


Glu Thr 








340 




345 










350 




Ser 


Asp 


Gin 


Ala Lys Ala 


Arg Gly Ala Asn Thr 


He 


He 


Pro 


Arg Lys 






355 




360 










365 






Leu 


Val 


Gin 


Val Leu Thr 


Pro Ser 


Thr 


Ala 


Ser 


Glu 


Glv 


Asn 


He Gly 




370 






375 








380 








Pro 


Asp 


Ala 


Val His Leu 


Leu Ala 


He 


Lys 


Glu 


He 


Lvs 


Met 


Glu Leu 


385 






390 








395 








400 


Gin 


Lys 


Cys 


Ser Thr Val 


Tyr Gly Phe 


Ala 


Phe 


Val 


Asp 


Cvs 


Ala Ala 








405 






410 










415 


Leu 


Arg 


Phe 


Trp Val Gly 


Ser He 


Ser Asp Asp 


Ala 


Ser 


Cvs 


Ala Ala 








420 




425 










430 




Leu 


Gly Ala 


Leu Leu Met 


Gin Val 


Ser 


Pro 


Lys 


Glu 


Val 


Leu 


Tyr Asp 






435 




440 










445 






Ser 


Lys Gly Leu Ser Arg 


Glu Ala 


Gin 


Lys 


Ala 


Leu 


Arg 


Lvs 


Tyr Thr 




450 






455 








460 








Jjc U. 


Thr Gly Ser Thr Ala 


Val Gin 


Leu 


Ala 


Pro 


Val 




m n 

VjJ.Il 


V ai. 1'IC V- 


465 






470 








475 








480 


Gly 


Asp 


Thr Asp Ala Ala 


Gly Val Arg Asn 


He 


He 


Glu 


Ser 


Asn Gly 








485 






490 










495 


Tyr 


Phe 


Lys 


Gly Ser Ser 


Glu Ser 


Trp 


Asn 


Cys 


Ala 


Val 


Asp 


Gly Leu 








500 




505 










510 




Asn 


Glu 


Cys 


Asp Val Ala 


Leu Ser 


Ala 


Leu Gly 


Glu 


Leu 


lie 


Asn His 






515 




520 










525 







Leu Ser Arg Leu 
530 

Pro Tyr Gin Val 
545 

Val Asn Leu Glu 

Thr Leu Tyr Lys 
580 

Leu Leu Arg Asn 
595 

Asn Lys Arg Leu 
610 

Met Gin lie Thr 
625 

Leu Leu Gly Arg 

Pro Ala Leu Leu 
660 

Gly Gin lie Val 
675 

Leu Gin Lys Glu 
690 

Leu Pro He Leu 
705 

Phe Glu Ala Ala 

Val Thr Asp Glu 
740 

He Glu Arg Ala 
755 

Leu Asp Val Leu 
770 

Ser Met Ala Arg 
785 

Asn Gin Lys Thr 

Pro Phe Ala Val 
820 

Leu Leu Gly Glu 
835 

Leu Leu Leu Thr 
850 

Ala Thr Cys Leu 
865 

Cys Glu Ser Cys 

Gly Ala Ser Asp 
900 

Cys Thr Glu Thr 
915 

Val He Leu Asp 
930 

Ala He Ala Tyr 
945 

Arg Met Leu Phe 
Ser His Pro Arg 



Lys Leu Glu Asp 
535 

Tyr Arg Gly Cys 
550 

He Phe Asn Asn 
565 

Tyr Leu Asp Asn 

Trp He Cys His 
600 

Asp Val Val Glu 
615 

Gly Gin Tyr Leu 
630 

lie Lys Ser Ser 
645 

Gly Lys Lys Val 

Lys Gly Phe Arg 
680 

Ser Asn Met Met 
695 

Val Gly Lys Ser 
710 

He Asp Ser Asp 
725 

Asn Ala Glu Thr 

Thr Gin Trp Ser 
760 

Arg Ser Phe Ala 
775 

Pro Val He Phe 
790 

Lys Gly Pro He 
805 

Ala Ala Asp Gly 

Ala Arg Arg Ser 
840 

Gly Pro Asn Met 
855 

Ala Val He Phe 
870 

Glu He Ser Leu 
885 

Arg He Met Thr 

Ala Ser Val Leu 
920 

Glu Leu Gly Arg 
935 

Ser Val Phe Arg 
950 

Ala Thr His Tyr 
965 

Val Thr Ser Lys 



Val Leu Lys His 
540 

Leu Arg He Asp 
555 

Ser Cys Asp Gly 
570 

Cys Val Ser Pro 
585 

Pro Leu Lys Asp 

Glu Phe Thr Ala 
620 

His Lys Leu Pro 
635 

Val Arg Ser Ser 
650 

Leu Lys Gin Arg 
665 

Ser Gly He Asp 

Ser Leu Leu Tyr 
700 

Gly Leu Glu Leu 
715 

Phe Pro Asn Tyr 
730 

Leu Thr He Leu 
745 

Glu Val He His 

He Ala Ala Ser 
780 

Pro Glu Ser Glu 
795 

Leu Lys He Gin 
810 

Gin Leu Pro Val 
825 

Ser Gly Ser He 

Gly Gly Lys Ser 
860 

Ala Gin Leu Gly 
875 

Val Asp Thr He 
890 

Gly Glu Ser Thr 
905 

Gin Asn Ala Thr 

Gly Thr Ser Thr 
940 

His Leu Val Glu 
955 

His Pro Leu Thr 
970 

His Met Ala Cys 



Gly Asp He Phe 

Gly Gin Thr Met 
560 

Gly Pro Ser Gly 
575 

Thr Gly Lys Arg 
590 

Val Glu Ser He 
605 

Asn Ser Glu Ser 

Asp Leu Glu Arg 
640 

Ala Ser Val Leu 
655 

Val Lys Ala Phe 
670 

Leu Leu Leu Ala 
685 

Lys Leu Cys Lys 

Phe Leu Ser Gin 
720 

Gin Asn Gin Asp 
735 

He Glu Leu Phe 
750 

Thr He Ser Cys 
765 

Leu Ser Ala Gly 

Ala Thr Asp Gin 
800 

Gly Leu Trp His 
815 

Pro Asn Asp He 
830 

His Pro Arg Ser 
845 

Thr Leu Leu Arg 

Cys Tyr Val Pro 
880 

Phe Thr Arg Leu 
895 

Phe Leu Val Glu 
910 

Gin Asp Ser Leu 
925 

Phe Asp Gly Tyr 

Lys Val Gin Cys 
960 

Lys Glu Phe Ala 
975 

Ala Phe Lys Ser 



980 985 990 

Arg Ser Asp Tyr Gin Pro Arg Gly Cys Asp Gin Asp Leu Val Phe Leu 

995 1000 1005 

Tyr Arg Leu Thr Glu Gly Ala Cys Pro Glu Ser Tyr Gly Leu Gin Val 

1010 1015 1020 

Ala Leu Met Ala Gly He Pro Asn Gin Val Val Glu Thr Ala Ser Gly 
1025 1030 1035 1040 

Ala Ala Gin Ala Met Lys Arg Ser He Gly Glu Asn Phe Lys Ser Ser 

1045 1050 1055 

Glu Leu Arg Ser Glu Phe Ser Ser Leu His Glu Asp Trp Leu Lys Ser 

1060 1065 1070 

Leu Val Gly He Ser Arg Val Ala His Asn Asn Ala Pro He Gly Glu 

1075 1080 1085 

Asp Asp Tyr Asp Thr Leu Phe Cys Leu Trp His Glu He Lys Ser Ser 

1090 1095 1100 

Tyr Cys Val Pro Lys 
1105 



<210> 32 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of ATHGENEA 
microsatellite . 

<400> 32 

accatgcata gcttaaactt cttg 

<210> 33 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of ATHGENEA 
microsatellite . 

<400> 33 

acataaccac aaataggggt gc 

<210> 34 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Forward primer DMCIN-A for PCR on genomic DNA of 
Arabidopsis thaliana ssp. Landsberg erecta "Ler" . 

<400> 34 

gaagcgatat tgttcgtg 



<210> 35 
<211> 18 



<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Reverse primer DMCIN-B for PCR on genomic DNA of 
Arabidopsis thaliana ssp. Landsberg erecta "Ler" 

<400> 35 

agattgcgag aacattcc 

<210> 36 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer DMCIN-1 for PCR on genomic DNA of 
Arabidopsis thaliana ssp. Landsberg erecta "Ler" 

<400> 36 

acgcgtcgac tcagctatga gattactcgt g 

<210> 37 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer DMCIN-2 for PCR on genomic DNA of 
Arabidopsis thaliana ssp. Landsberg erecta "Ler" 

<400> 37 

gctctagatt tctcgctcta agactctct 

<210> 38 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer DMCIN-3 for PCR on genomic DNA of 
Arabidopsis thaliana ssp. Landsberg erecta "Ler" 

<400> 38 

gctctagagc ttctcttaag taagtgattg at 

<210> 39 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer DMCIN-4 for PCR on genomic DNA of 
Arabidopsis thaliana ssp. Landsberg erecta "Ler" 

<400> 39 

tcccccgggc tcgagagatc tccatggttt cttcagctct atgaatcc 



<210> 40 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer DMCla for PCR on genomic DNA of 

Arabidopsis. thaliana ssp. Landsberg erecta "Ler" . 

<400> 40 

acgcgtcgac gaattcgcaa gtgggg 

<210> 41 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer DMClb for PCR on genomic DNA of 

Arabidopsis thaliana ssp. Landsberg erecta "Ler". 

<400> 41 

tccatggaga tctcccgggt accgatttgc ttcgaggg 

<210> 42 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of ATEAT1 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 42 

gccactgcgt gaatgatatg 

<210> 43 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of ATEAT1 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 43 

cgaacagcca acattaattc cc 

<210> 44 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of NGA63 SSLP 
marker in Arabidopsis thaliana subspecies. 



<400> 44 

aaccaaggca cagaagcg 



18 



<210> 45 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA63 SSLP 
marker in Arabidopsis thaliana subspecies. 

<400> 45 

acccaagtga tcgccacc 18 

<210> 46 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of NGA248 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 46 

taccgaacca aaacacaaag g 21 

<210> 47 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA24 8 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 47 

tctgtatctc ggtgaattct cc 22 

<210> 48 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of NGA128 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 48 

ggtctgttga tgtcgtaagt eg 22 

<210> 49 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Reverse primer for PCR amplification of NGA12 8 
SSLP marker in Arabidopsis thaliana subspecies 

<400> 49 

atcttgaaac ctttagggag gg 

<210> 50 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of NGA280 
SSLP marker in Arabidopsis thaliana subspecies 

<400> 50 

ctgatctcac ggacaatagt gc 

<210> 51 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA280 
SSLP marker in Arabidopsis thaliana subspecies 

<400> 51 

ggctccataa aaagtgcacc 

<210> 52 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of NGA111 
SSLP marker in Arabidopsis thaliana subspecies 

<400> 52 

ctccagttgg aagctaaagg g 

<210> 53 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA111 
SSLP marker in Arabidopsis thaliana subspecies 

<400> 53 

tgttttttag gacaaatggc g 

<210> 54 
<211> 20 



<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Forward primer for PCR amplification of NGA16 8 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 54 

ccttcacatc caaaacccac 20 

<210> 55 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA168 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 55 

gcacataccc acaaccagaa 20 

<210> 56 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of NGA112 6 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 56 

cgctacgctt ttcggtaaag 20 

<210> 57 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA1126 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 57 

gcacagtcca agtcacaacc 20 

<210> 58 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Forward primer for PCR amplification of NGA3 61 
SSLP marker in Arabidopsis thaliana subspecies. 



<400> 58 

aaagagatga gaatttggac 
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<210> 59 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA361 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 59 

acatatcaat atattaaagt age 23 

<210> 60 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of NGA168 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 60 

tegtctactg cactgccg 18 

<210> 61 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA168 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 61 

gaggacatgt ataggagect eg 22 

<210> 62 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of AthBI02 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 62 

tgacctcctc ttccatggag 20 

<210> 63 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Reverse primer for PCR amplification of AthBI02 
SSLP marker in Arabidopsis thaliana subspecies. 



<400> 63 

ttaacagaaa cccaaagctt tc 



<210> 64 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of AthUBIQUE 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 64 

aggcaaatgt ccatttcatt g 

<210> 65 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of AthUBIQUE 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 65 

acgacatggc agatttctcc 

<210> 66 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of NGA172 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 66 

agctgcttcc ttatagcgtc c 

<210> 67 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA172 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 67 

catccgaatg ccattgttc 

<210> 68 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Forward primer for PCR amplification of NGA126 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 68 

gaaaaaacgc tactttcgtg g 21 

<210> 69 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA126 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 69 

caagagcaat atcaagagca gc 22 

<210> 70 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of NGA162 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 70 

catgcaattt gcatctgagg 20 

<210> 71 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA162 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 71 

ctctgtcact cttttcctct gg 22 

<210> 72 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Forward primer for PCR amplification of NGA6 SSLP 
marker in Arabidopsis thaliana subspecies. 

<400> 72 

tggatttctt cctctcttca c 21 



<210> 73 
<211> 21 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA6 SSLP 
marker in Arabidopsis thaliana subspecies. 

<400> 73 

atggagaagc ttacactgat c 21 

<210> 74 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of NGA12 SSLP 
marker in Arabidopsis thaliana subspecies. 

<400> 74 

aatgttgtcc tcccctcctc 20 

<210> 75 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA12 SSLP 
marker in Arabidopsis thaliana subspecies. 

<400> 75 

tgatgctctc tgaaacaaga gc 22 

<210> 76 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of NGA8 SSLP 
marker in Arabidopsis thaliana subspecies. 

<400> 76 

gagggcaaat ctttatttcg g 21 

<210> 77 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA8 SSLP 
marker in Arabidopsis thaliana subspecies. 



<400> 77 

tggctttcgt ttataaacat cc 
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<210> 78 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of NGA1107 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 78 

gcgaaaaaac aaaaaaatcc a 

<210> 79 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA1107 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 79 

cgacgaatcg acagaattag g 

<210> 80 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of NGA225 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 80 

gaaatccaaa tcccagagag g 

<210> 81 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA225 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 81 

tctccccact agttttgtgt cc 

<210> 82 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of NGA249 
SSLP marker in Arabidopsis thaliana subspecies. 



<400> 82 

taccgtcaat ttcatcgcc 

<210> 83 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA249 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 83 

ggatccctaa ctgtaaaatc cc 

<210> 84 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of CA72 SSLP 
marker in Arabidopsis thaliana subspecies. 

<400> 84 

aatcccagta accaaacaca ca 

<210> 85 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of CA72 SSLP 
marker in Arabidopsis thaliana subspecies. 

<400> 85 

cccagtctaa ccacgaccac 

<210> 86 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of NGA151 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 86 

gttttgggaa gttttgctgg 

<210> 87 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Reverse primer for PCR amplification of NGA151 
SSLP marker in Arabidopsis thaliana subspecies 

<400> 87 

cagtctaaaa gcgagagtat gatg 

<210> 88 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of NGA106 
SSLP marker in Arabidopsis thaliana subspecies 

<400> 88 

gttatggagt ttctagggca eg 

<210> 89 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA106 
SSLP marker in Arabidopsis thaliana subspecies 

<400> 89 

tgccccattt tgttcttctc 

<210> 90 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of NGA13 9 
SSLP marker in Arabidopsis thaliana subspecies 

<400> 90 

agagctacca gatccgatgg 

<210> 91 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA13 9 
SSLP marker in Arabidopsis thaliana subspecies 

<400> 91 

ggtttcgttt cactatccag g 

<210> 92 
<211> 22 



<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Forward primer for PCR amplification of NGA76 SSLP 
marker in Arabidopsis thaliana subspecies. 

<400> 92 

ggagaaaatg tcactctcca cc 

<210> 93 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA76 SSLP 
marker in Arabidopsis thaliana subspecies. 

<400> 93 

aggcatggga gacatttacg 

<210> 94 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of ATHS0191 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 94 

ctccaccaat catgcaaatg 

<210> 95 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of ATHS0191 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 95 

tgatgttgat ggagatggtc a 

<210> 96 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward primer for PCR amplification of NGA12 9 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 96 

tcaggaggaa ctaaagtgag gg 
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<210> 97 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse primer for PCR amplification of NGA12 9 
SSLP marker in Arabidopsis thaliana subspecies. 

<400> 97 

cacactgaag atggtcttga gg 22 

<210> 98 
<211> 8062 
<212> DNA 

<213> Arabidopsis thaliana ecotype Columbia 
<220> 

<22 3> Genomic DNA sequence of AtMSH6 
<400> 98 

ttttttggtt gctaacaata aaggtatacg gttttatgtc atcaatataa ctatatataa 60 
aagaaatgaa agatatatat tgttttttca tttatcaaac aaaacaacaa gacttttttt 120 
ttacttttta cattggtcaa caaaatacaa gataaacgac atcgtttaat catttcccaa 180 
ttttacccct aagtttaaca cctagaacct tctccatctt cgcaagcaca gcctgattag 240 
gaacagcttt accattctca tattcctgaa ctacctgagt cctctcattg atctgtttcg 300 
ccaaatccgc ttgtgacatc ttcttctcca atctcgcttt ctgtatcatc aacctcacct 360 
ctgctttcac acgatccatc gccgcaggct ctgtttcttc ttccagcttc ttcgtgttaa 420 
tcaccggaac cgccgtagat ttcccctttt tgttcgaacc ggcatcgaat ttcttaaccg 480 
tttgaaccgc gacaccgttt ctcagagctg cgttaaccgc tttcggatcg cgtaggtctt 540 
ggctcttttg ttttgatttg tggagaacta ctggttccca gtcttgtgtt actgctcctg 600 
ggtatctgct cggcatcgtc gatgaattga gagaaaggaa caacgcgaaa attttattaa 660 
tctgagtttt gaaattgaga aacgatgaag atgaagaatg ttgttgagag gattgtgata 720 
tttatatata cgaagattgg tttctggaga attcgatcat ctttttctcc attttcgtct 780 
ctggaacgtt cttagagatg attgacgacg tgtcattatc tgatttgcag ttaaccaatg 840 
ctttttgggt tggattcgtg gtacaccata ttatccgatt tggctcaatg gttttatata 900 
aatttggttt tcggttcggt tatgagttat cattaaaatt aagctaacca aaaattttcg 960 
taaaatttat ttcggtttca attcggatcc cttacttcca gaaccgaatt attcgaaacc 1020 
ggggttagcc gaaccgaata ccaatgcctg attgactcgt tggctagaaa gatccaacgg 1080 
tatacaataa tagaacataa atcggacggt catcaaagcc tcaaagagtg aacagtcaac 114 0 
aaaaaaagtt gagccctgag gagtatcgtt tccgccattt ctacgacgca aggcgaaaat 1200 
ttttggcgcc aatctttccc ccctttcgaa ttctctcagc tcaaaacatc gtttctctct 1260 
cactctctct cacaattcca aaaaatgcag cgccagagat cgattttgtc tttcttccaa 1320 
aaacccacgg cggcgactac gaagggtttg gtttccggcg atgctgctag cggcgggggc 1380 
ggcagcggag accacgattt aatgtgaagg aaggggatgc taaaggcgac gcttctgtac 144 0 
gttttgctgt ttcgaaatct gtcgatgagg ttagaggaac ggatactcca ccggagaagg 1500 
ttccgcgtcg tgtcctgccg tctggattta agccggctga atccgccggt gatgcttcgt 1560 
ccctgttctc caatattatg cataagtttg taaaagtcga tgatcgagat tgttctggag 1620 
agaggtacta atcttcgatt ctcttaattt tgttatcttt agctggaaga agaagattcg 1680 
tgtaatttgt tgtattcgtt ggagagattc tgattactgc attggatcgt tgtttacaaa 1740 
ttttcaggag ccgagaagat gttgttccgc tgaatgattc atctctatgt atgaaggcta 1800 
atgatgttat tcctcaattt cgttccaata atggtaaaac tcaagaaaga aaccatgctt 1860 
ttagtttcag tgggagagct gaacttagat cagtagaaga tataggagta gatggcgatg 192 0 
ttcctggtcc agaaacacca gggatgcgtc cacgtgcttc tcgcttgaag cgagttctgg 1980 
aggatgaaat gacttttaag gaggataagg ttcctgtatt ggactctaac aaaaggctga 2040 
aaatgctcca ggatccggtt tgtggagaga agaaagaagt aaacgaagga accaaatttg 2100 
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aatggcttga gtcttctcga atcagggatg 
acgatagaaa gaccttacac ataccacctg 
agcaatattg gagtgttaag agtgaatata 
gtaactatta atctagtgtt caatccattt 
ttacgttatg ctcttctcag gggaaatttt 
gtcacaagga gcttgactgg aagatgacca 
attagttgaa acaactggcc tgcttgaatt 
tgtttcaggt tggtatctct gaaagtggga 
gtgggtaagg gaaccatcat actttatgga 
aagaaatgga aatcacttca agcatcatta 
ttattcgtta tataataagt cttttcatca 
tgtgtgctca ctggcaaggc atatataccc 
agttacggat aatatatgtt tacttgtatg 
tttcaagcct tgtgttgaaa ctggatagtt 
gattcatatg ctttcctatt ctactaatta 
agatataaag ttggacgaat cgagcagcta 
gctaatactg taagttttct tggataggtc 
tttctttttc tgtacattac tttcatgctg 
tatcagataa ttccaaggaa gctagttcag 
aacatcgggc ctgatgccgt ccatcttctt 
tatcttatca tgttcagttc atccaagtcc 
ccatcaagct gtgtaaagga tttggaatta 
tgcaagaggt tcccttgaaa agatctgttt 
ttttaagtga atcccctact ttcttacaat 
tatcattcct tgttactgca tccccctctt 
tctcgtgtgc tggagaaaat gatagctgat 
tgctcaggaa ttgcctttgg ttacattgcc 
tctttatttt agatcaaaat ggagctacaa 
gttgactgtg ctgccttgag gttttgggtt 
gctcttggag cgttattgat gcaggtaagc 
tgacttcctg tgcatatatt tgggttgcag 
tttttccagg tttctccaaa ggaagtgtta 
ccagttgttt tgttaaacag aatttaaggt 
atgttgaaat attgcagggc tatcaagaga 
gacaggtacc atttcagtag gcaagctaac 
tctcttaaac attgctaatg tagatgatgt 
gtacagttgg ctccagtacc acaagtaatg 
ataatagaat ctaacggata ctttaaaggt 
ggtctaaatg aatgtgatgt tgcccttagt 
aggctaaagg tgtgttggct tgtttagttt 
ttcataactt acagtttcta tctacttgca 
ttttccatac caagtttaca ggggttgtct 
tgagatattt aacaatagct gtgatggtgg 
ataacttcaa ctagagggca gacatagaag 
agtaagtaat agccgatttt tgtttaccta 
ctgtgttagt ccaactggta agcgactctt 
tgtagaaagc atcaataaac ggcttgatgt 
tatgcaaatc actggccagt atctccacaa 
catcaagtct agcgttcgat catcagcctc 
gctgaaacaa cgagtaagta tcaatcacaa 
ataggactaa aacattacgg gtctagctaa 
ttattcatta catttctctt aacttattgc 
gaaagggttc agaagtggaa ttgatctgtt 
gagtttgctt tataaactct gtaaacttcc 
atttctttct caattcgaag cagccataga 
ctatctttca tactttacaa caaaatgtct 
gatctcaact cacaccccga ggatcctaaa 



ccaatagaag acgtcctgat gatccccttt 2160 
atgttttcaa gaaaatgtct gcatcacaaa 2220 
tggacattgt gcttttcttt aaagtggtta 2280 
cctcaatgtg atttgttcac ttacatctgt 2340 
atgagctgta tgagctagat gcggaattag 24 00 
tgagtggtgt gggaaaatgc agacaggtaa 24 60 
attgtgtcta taaattttga caccaccttt 2520 
tagatgaggc agtgcaaaag ctattagctc 2580 
attcgtttac tgctacttcg gctaggattt 2640 
gttaggatcc tgagaactca ggatgttttc 2700 
aggagtaaca aacaaaactt gcacaatatt 2760 
agctaacctt tgctagttca ctgtagtaac 2820 
tggtaccctc attttgtctc tcatggaggc 2880 
acatatgctt ccaacagaaa ctagcatgca 2940 
tgtattgaca cactcgttgt ttcttttgaa 3000 
gaaacatctg accaagcaaa agccagaggt 3060 
aaggagagtg ttgcagactg tttttgatca 312 0 
taattaactc aatggctatt ctggtctgat 3180 
gtattaactc catcaacagc aagcgaggga 3240 
gctataaaag aggtttgtta tttacttatt 3300 
tgaaaaatta cactcttctt taccaatctt 3360 
gaaaatcatt atttgatgct ttgttttata 3420 
aagattcttt gcacttgaaa aattcaatct 3480 
gatcatagtc tgcaattgca tgtcaagtaa 3540 
tcttaatgac cattgtctat gttgtgtttg 3600 
ccaagctgta cattatcatg attaagtagc 3660 
taatggtttg atgtcaattt ttcttctgaa 3720 
aagtgttcaa ctgtgtatgg atttgctttt 3780 
gggtccatca gcgatgatgc atcatgtgct 3840 
aagtgtattc tgtatcttat gtgtaccatg 3900 
gaactaattc tgaatcacca tttggtatgt 3960 
tatgacagta aaggtaaact gcttgtatcg 402 0 
aaatgacact ggttaattta aagtgcatac 4080 
agcacaaaag gctctaagga aatatacgtt 4140 
tgacaattta accgctcacc gaatgatagg 4200 
ttatgtttca atctaatagg gtctacggcg 4260 
ggggatacag atgctgctgg agttagaaat 4 32 0 
tcttctgaat catggaactg tgctgttgat 4380 
gctcttggag agctaattaa tcatctgtct 4440 
ttgcttttca caaattaagc aaaggaactt 4500 
gctagaagat gtacttaagc atggggatat 4560 
cagaattgat ggccagacga tggtaaatct 4620 
tccttcaggc aagtgcatat ttcttttttg 4680 
gaaaaattct aatacttcgt acggatctcc 4740 
tgtagggacc ttgtacaaat atcttgataa 4800 
aaggaattgg atctgccatc cactcaaaga 4860 
agttgaagaa ttcacggcaa actcagaaag 4 920 
acttccagac ttagaaagac tgctcggacg 4980 
tgtgttgcct gctcttctgg ggaaaaaagt 5040 
gttttctgag taatgccttc catgagtagt 5100 
agactgttct ccttcttttg caatgtctgg 5160 
attgcaggtt aaagcatttg ggcaaattgt 5220 
gttggctcta cagaaggaat caaatatgat 52 80 
tatattagta ggaaaaagcg ggctagagtt 5340 
tagcgacttt ccaaattatc aggtgcccat 5400 
gtcactactc aaagcaatgc atatggctta 5460 
gggatttgct ttttattcct aatgtttttg 5520 



gatggtttga tttatttcta acttgaactt attaatcttg taccagaacc aagatgtgac 5580 
agatgaaaac gctgaaactc tcacaatact tatcgaactt tttatcgaaa gagcaactca 5640 
atggtctgag gtcattcaca ccataagctg cctagatgtc ctgagatctt ttgcaatcgc 5700 
agcaagtctc tctgctggaa gcatggccag gcctgttatt tttcccgaat cagaagctac 5760 
agatcagaat cagaaaacaa aagggccaat acttaaaatc caaggactat ggcatccatt 5820 
tgcagttgca gccgatggtc aattgcctgt tccgaatgat atactccttg gcgaggctag 5880 
aagaagcagt ggcagcattc atcctcggtc attgttactg acgggaccaa acatgggcgg 5940 
aaaatcaact cttcttcgtg caacatgtct ggccgttatc tttgcccaag tttgtatact 6000 
cgttagataa ttactctatt ctttgcaatc agttcttcaa catgaataat aaattctgtt 6060 
ttctgtctgc agcttggctg ctacgtgccg tgtgagtctt gcgaaatctc cctcgtggat 6120 
actatcttca caaggcttgg cgcatctgat agaatcatga caggagagag taagttttgt 6180 
tctcaaaata ccaattcctc gaactattta ctcagatttt gtctgattgg acaaggtggt 6240 
tttgcttttt tttaggtacc tttttggtag aatgcactga gacagcgtca gttcttcaga 6300 
atgcaactca ggattcacta gtaatccttg acgaactggg cagaggaact agtactttcg 6360 
atggatacgc cattgcatac tcggtaacct gctcttctcc ttcaacttat acttgttgat 6420 
caacaaaaac atgcaattca ttttgctgaa acttattgat ttatatcagg tttttcgtca 6480 
cctggtagag aaagttcaat gtcggatgct ctttgcaaca cattaccacc ctctcaccaa 6540 
ggaattcgcg tctcacccac gtgtcacctc gaaacacatg gcttgcgcat tcaaatcaag 6600 
atctgattat caaccacgtg gttgtgatca agacctagtg ttcttgtacc gtttaaccga 6660 
gggagcttgt cctgagagct acggacttca agtggcactc atggctggaa taccaaacca 672 0 
agtggttgaa acagcatcag gtgctgctca agccatgaag agatcaattg gggaaaactt 67 80 
caagtcaagt gagctaagat ctgagttctc aagtctgcat gaagactggc tcaagtcatt 6840 
ggtgggtatt tctcgagtcg cccacaacaa tgcccccatt ggcgaagatg actacgacac 6900 
tttgttttgc ttatggcatg agatcaaatc ctcttactgt gttcccaaat aaatggctat 6960 
gacataacac tatctgaagc tcgttaagtc ttttgcttct ctgatgttta ttcctcttaa 7020 
aaaatgctta tatatcaaaa aattgtttcc tcgattataa caagattata tatgtatctg 7080 
tcggtttagc tatggtatat aatatatgta tgttcatgag attggtcaag agaaatactc 714 0 
acaaacagta tattaagaag gaaatatgtt tatgcattaa tttaagtttc aagataaact 7200 
gcaaataacc tcgactaaag ttgcaaagac caaacacaaa ttacaaaact tataagactt 7260 
aagttctgaa ttccctaaaa ccaaaaaaaa aaacagaaca tattttgttg catctacaaa 7320 
caacacaaac ctacatagtt tataacttac tcatcactga gattaacatc agaatcattc 7380 
tccatttctt catcttcact ctcatcatca tcaccaccac catgatgatt ctcctcctct 7440 
tcacgtaacc tagcaatctc actctgagct ctatcaacaa tctgcttctt ctgcaactcc 7500 
aaatctctct gaaaatcagc tctcatcttc tccaactcct tcatttgctc tttcttactc 7560 
ttctccatct tctcataaac cttcccaaac ctctcaacag aatccgccaa catcttatac 7620 
gaagcagcgt cattaacctt cttcctctcg tactcaacct catcatcctc atcctcctcc 7680 
tcttcagaat caccaggact atccatcatc tcatcaaacc cattagactt atctaaataa 7740 
accttagtgt tcataaacac aaactcacct gaatcaacac cacaagctaa acctaaatcc 7800 
gacttgggcg aaacacaaag caacatatcc aacttattga aaaacgacca tttacttgaa 7860 
cctaaacctg atttctcaac cttaatcttc tcttttctat acttcctctt caagtcatca 7920 
atcattctcc tacattgcgt ctcagatttc tccatcctta gctcctcact cactttctca 7980 
gctacttcat tccaatcctc gttcctcaaa ctccttctac ccaattgcaa aaacctatct 8040 
ccccaaactt caagcaacac aa 8062 



<210> 99 
<211> 1047 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 99 

Met Val lie Gly Asn Glu Pro Lys Leu Val Leu Leu Arg Ala Lys Ser 

15 10 15 

Ser Ala Asn Arg Phe lie Leu Leu Asn Leu Leu Thr lie Met Ala Gly 

20 25 30 

Gin Pro Thr He Ser Arg Phe Phe Lys Lys Ala Val Lys Ser Glu Leu 



35 

Thr His Lys Gin 
50 

Glu Ser He Cys 
65 

Ala Ser Thr Thr 

Ser Ser Lys Asn 
100 

Thr Phe Asn Asp 
115 

Arg Arg Ser Asp 
130 

Glu Glu Asp Phe 
145 

Leu Thr Pro Leu 

Asp Lys Val Leu 
180 

Glu Asp Ala Val 
195 

Gly Lys Leu Thr 
210 

Gin Phe Ala Tyr 
225 

Glu Arg Leu Val 

Glu Thr Ser Ala 
260 

Val Phe Glu Arg 
275 

Val Asn Ser Thr 
290 

Asn Ser He Trp 
305 

Lys Tyr Ser Leu 

Asp Glu Phe Glu 
340 

He Lys Tyr Leu 
355 

Pro Leu His Val 
370 

His Lys Gin Glu 
385 

Val Met Asn Glu 

Ser Lys Leu Tyr 
420 

Leu He Pro Ser 
435 

Leu Asp Pro Asn 
450 

Gly Lys Gly Ser 
465 

Gly Leu Arg Met 



40 

Glu Gin Glu Val 
55 

Leu Asp Thr Asp 
70 

Val Thr Asn Asp 
85 

Ser Lys Asn Ser 

He Asp Phe Ala 
120 

Glu Asn Val Glu 
135 

Val Lys Lys Lys 
150 

Asp Lys Gin Val 
165 

Val He Arg Val 

Thr Val Ser Arg 
200 

He Asp Glu Ser 
215 

Cys Ser Phe Pro 
230 

His His Asn Leu 
245 

He Lys Lys His 

Lys He Ser Asn 
280 

Phe Val Leu Arg 
295 

Ala Leu Ser Arg 
310 

He Ser Val Asn 
325 

Glu Pro Asn Leu 

Gin Pro He Glu 
360 

Ala Lys Phe Phe 
375 

Tyr Asp Leu Glu 
390 

Lys He Gin Leu 
405 

Ser His Met Val 

He Tyr Ser Pro 
440 

Ser Leu Gin Ser 
455 

Leu Phe Trp Leu 
470 

Leu Arg Glu Trp 
485 



Ala Val Gly Asn 
60 

Glu Glu Asp Asn 
75 

Ser Phe Pro Leu 
90 

Glu Lys Thr Ser 
105 

Lys Lys Leu Asp 

Ala Glu Asp Asp 
140 

Ala Arg Lys Ser 
155 

Lys Asp Leu Lys 
170 

Gly Tyr Lys Tyr 
185 

He Leu His He 

Asn Pro Gin Asp 
220 

Asp Val Arg Leu 
235 

Lys Val Ala Val 
250 

Asp Pro Gly Ala 
265 

Val Phe Thr Lys 

Gly Lys Arg He 
300 

Asp Val His Gin 
315 

Leu Asn Asn Gly 
330 

Ala Asp Glu Lys 
345 

Val Leu Val Asn 

Lys Asp He Ser 
380 

Asp His Val Val 
395 

Ser Pro Ser Leu 
410 

Glu Tyr Asn Asn 
425 

Phe Ala Ser Lys 

Leu Asp He Phe 
460 

Leu Asp His Thr 
475 

He Leu Lys Pro 
490 



45 

Gly Ala Gly Ser 

Leu Ser Ser Val 
80 

Lys Gly Ser Val 
95 

Gly Thr Ser Thr 
110 

Arg He Met Lys 
125 

Glu Glu Glu Gly 

Pro Thr Ala Lys 
160 

Met His His Arg 
175 

Lys Cys Phe Ala 
190 

Lys Leu Val Pro 
205 

Cys Asn His Arg 

Asn Val His Leu 
240 

Val Glu Gin Ala 
255 

Ser Lys Ser Ser 
270 

Ala Thr Phe Gly 
285 

Leu Gly Asp Thr 

Gly Lys Val Ala 
320 

Glu Val Val Tyr 
335 

Leu Gin He Arg 
350 

Thr Asp Asp Leu 
365 

Cys Pro Leu He 

Gin Ala He Lys 
400 

He Arg Leu Val 
415 

Glu Gin Val Met 
430 

He His Met Leu 
445 

Thr His Asp Gly 

Arg Thr Ser Phe 
480 

Leu Val Asp Val 
495 



1 



ill o 


Olll 


Tie 

IXC 




Glu 

VJ1U 




Leu 


Asp 


Ala 


He 


Glu 


Cvs 


He 


Thr 


Ser 


Glu 








5 00 

.J \J w 










505 










510 






Tip 


Aon 
ASH 


no 11 


Q fa y- 
OC X 


Tip 


Php 

JT11C 


Phe 

C 11G 


Glu 


Ser 


Leu 


Asn 


Gin 


Met 


Leu 


Asn 


His 






515 










520 










525 








Thr 


rl U 


Aon 

Ab LJ 


T .pi i 

XJC LI 


TiPii 

XJC LI 




Thr 


Leu 


Asn 


Ara 

■fix. y 


He 


Met 


Tyr 


Gly 


Thr 


Thr 














535 










540 












Arg 


Xjy b 


UJ.U 


Val 


Tvr 


Phe 


Tvr 

±y x 


Leu 


Lvs 


Gin 


He 


Thr 


Ser 


Phe 


Val 












550 










555 










560 




TJ-i e 
til b 


Php 

JriJ.C 


T iVQ 


nc l. 


Hi r 

111 0 


Gin 

VJ X J J- 


Ser 


Tvr 

xy x 


Leu 


Ser 


Glu 


His 


Phe 


Lys 


Ser 








565 

J \J ,J 










570 










575 




O A T* 

OCX 






A 7-rr 

Arg 


Hp 

11C 


Gl v 
vjx y 


jjy o 


Gin 

WX J-l 


Ser 


Pro 


Leu 


Leu 


Phe 


Arq 


Leu 


Phe 








580 

J O V 










585 










590 






OCi 


ox u. 


T .01 1 
XjC u. 


& en 


Gl n 

VJX Li. 


TiPil 

JJwU 


Leu 


Ser 


Thr 


Thr 


Gin 


Leu 


Pro 


His 


Phe 


Leu 






595 
-j j -> 










600 










605 








1 111 


M af- 
rit; l. 


Tip 
x x c 




Val 

V d X 




Ala 


Val 


Met 


Glu 


Lys 


Asn 


Ser 


Asp 


Lys 


Gin 




610 










615 










620 












i*ic u. 




Php 


Phe 


Asn 


Leu 


Asn 


Asn 


Tvr 


Asd 


Cvs 


Ser 


Glu 


Gly 


He 


625 










630 










635 










640 


Tl P 

X X c 


xi y o 


lie 

X X W 


Gin 

11 


Arg 


Glu 


Ser 


Glu 


Ser 


Val 


Ara 


Ser 


Gin 


Leu 


Lys 


Glu 










645 










650 










655 




\J1 u 


T.PH 


Ala 


Glu 

\J X LI 


He 

X X C 


Ara 

«.x,y 


Lys 


Tvr 


Leu 


Lys 


Arq 


Pro 


Tyr 


Leu 


Asn 


Phe 








660 










665 










670 






7\ v-rt 

my 


Ben 
Ab^ 




1 

V CL X 




iyr 


T iPi i 

J-IC Ll 


He 

X 1 w 


Glu 


Val 


Lys 


Asn 


Ser 


Gin 


He 


Lys 






675 










680 










685 








Abp 


J_ic u 


Ir 1 \J 


A QTl 


nop 


irp 


lie 

X X c 




Val 


Asn 


Asn 


Thr 


Lys 


Met 


Val 


Ser 




690 










695 










700 










A -k-(-t 

Arg 


php 
jrlic 


Thr 


Thr 


Pro 

JTX \J 


Arg 


Th T" 


Gin 


Lys 


Leu 


Thr 


Gin 


Lvs 


Leu 


Glu 


Tvr 


705 










710 










715 










720 


iyr 


xi y o 


Sr 


T.Pii 
J—l c. li 




He 

J- -X w 


A rn 


Glu 


Ser 


Glu 


Leu 


Gin 


Tvr 

i j - 


Lvs 


Glu 


Phe 










725 










730 










735 




TiP 13 




xjy o 


He 


Thr 

X -l J. X 


Ala 


Glu 


Tvr 


Thr 


Glu 


Leu 


Ara 


Lvs 


lie 


Thr 


Leu 








740 










745 










750 






Ben 


XJC u. 


Al 3 
AX Ct 


m n 

VJJ.X1 






^y& 


He 

X Xw 




Ser 


Leu 


Ala 


Ala 


Thr 


Ser 


Cvs 






755 










760 










765 








S es n 


\r* 1 


Sen 


iyr 


v d X 


Arg 


□ rn 


J. 11X 


Php 


Val 

V CI X 


A°T1 
no x i 


Glv 

\j x y 


Gin 

w X. XL 


Gin 


Ala 


lie 




77 n 










775 










780 










Tip 
1 1 C 


Ala 


xjy o 


Son 

noli 


Ala 

rtl d 


Arg 


A QTl 


t J-U 


He 

X X c 


He 

X X 


Glu 

W X XX 


Ser 


Leu 


Asp 


Val 


His 


7 










7 90 










795 










800 


iyr 


Veil 


Jrx U 


A q rl 


Aop 


Tip 

11C 




MPt" 

1*1 C L. 


Cor 

OCX 


Pro 

IT X w 


Glu 

W X Ll 


Asn 

"Oil 


Glv 

vjxy 


Lys 


lie 


Asn 










805 










810 










815 




lie 

X X c 


He 

X X C 


Thr 

X -L1X 


Glv 
vjiy 


Pro 

^ X v 


A e n 


Met 


Glv 


Glv 


Lys 


Ser 


Ser 


Tvr 


He 


Arq 


Gin 








820 

U £i \J 










825 










830 






V ell 


Al a 


XJC IX 


XJC LI 


X 11X 


Tip 

X X c 


1 1C 1— 


Ala 


Gl n 

w X 11 


He 

X X W 


Gly 


Ser 


Phe 


Val 


Pro 


Ala 






835 










840 










845 










VJ1U 


T1p 
x x c 


nx y 


T i 

UC LI 


^PT 
OCX 


Tie 

X xc 


Php 

IT 11^ 


Glu 


Asn 


Val 


Leu 


Thr 


Arg 


He 


Glv 




85n 










855 

O <J ^ 










860 










Al d 


ni s 


A C!t"\ 
nop 


A cn 
Mop 


Tip 
lie 


Tl p 

X X c 


noil 


Gl v 
vjx y 


A cin 


OCX 


Thr 

llll 


Phe 


Lys 


Val 


Glu 


Met 


D U -> 










670 










875 










880 


XjC U. 


Abp 


Tl fa 

lie 


Leu 


ill b 


T1p 
lie 


XjC Ll 


xjy o 


A en 


y o 


Aqri 
rlo 11 


xjy o 




RpT 

OCX 


Leu 


Leu 










885 










890 










895 




Leu 


Leu 


Asp 


Glu 


Val 


Gly 


Arg 


Gly 


Thr 


Gly 


Thr 


His 


Asp 


Gly 


He 


Ala 








900 










905 










910 






He 


Ser 


Tyr 


Ala 


Leu 


lie 


Lys 


Tyr 


Phe 


Ser 


Glu 


Leu 


Ser 


Asp 


Cys 


Pro 






915 










920 










925 








Leu 


He 


Leu 


Phe 


Thr 


Thr 


His 


Phe 


Pro 


Met 


Leu 


Gly 


Glu 


He 


Lys 


Ser 




930 










935 










94 0 










Pro 


Leu 


He 


Arg 


Asn 


Tyr 


His 


Met 


Asp 


Tyr 


Val 


Glu 


Glu 


Gin 


Lys 


Thr 



1 * 



945 950 955 960 

Gly Glu Asp Trp Met Ser Val He Phe Leu Tyr Lys Leu Lys Lys Gly 

965 970 975 

Leu Thr Tyr Asn Ser Tyr Gly Met Asn Val Ala Lys Leu Ala Arg Leu 

980 985 990 

Asp Lys Asp He He Asn Arg Ala Phe Ser He Ser Glu Glu Leu Arg 

995 1000 1005 

Lys Glu Ser He Asn Glu Asp Ala Leu Lys Leu Phe Ser Ser Leu Lys 

1010 1015 1020 

Arg He Leu Lys Ser Asp Asn He Thr Ala Thr Asp Lys Leu Ala Lys 
1025 1030 1035 1040 

Leu Leu Ser Leu Asp He His 
1045 



<210> 100 
<211> 1242 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 100 

Met Ala Pro Ala Thr Pro Lys Thr Ser Lys Thr Ala His Phe Glu Asn 

15 10 15 

Gly Ser Thr Ser Ser Gin Lys Lys Met Lys Gin Ser Ser Leu Leu Ser 

20 25 30 

Phe Phe Ser Lys Gin Val Pro Ser Gly Thr Pro Ser Lys Lys Val Gin 

35 40 45 

Lys Pro Thr Pro Ala Thr Leu Glu Asn Thr Ala Thr Asp Lys He Thr 

50 55 60 

Lys Asn Pro Gin Gly Gly Lys Thr Gly Lys Leu Phe Val Asp Val Asp 
65 70 75 80 

Glu Asp Asn Asp Leu Thr He Ala Glu Glu Thr Val Ser Thr Val Arg 

85 90 95 

Ser Asp He Met His Ser Gin Glu Pro Gin Ser Asp Thr Met Leu Asn 

100 105 110 

Ser Asn Thr Thr Glu Pro Lys Ser Thr Thr Thr Asp Glu Asp Leu Ser 

115 120 125 

Ser Ser Gin Ser Arg Arg Asn His Lys Arg Arg Val Asn Tyr Ala Glu 

130 135 140 

Ser Asp Asp Asp Asp Ser Asp Thr Thr Phe Thr Ala Lys Arg Lys Lys 
145 150 155 160 

Gly Lys Val Val Asp Ser Glu Ser Asp Glu Asp Glu Tyr Leu Pro Asp 

165 170 175 

Lys Asn Asp Gly Asp Glu Asp Asp Asp He Ala Asp Asp Lys Glu Asp 

180 185 190 

He Lys Gly Glu Leu Ala Glu Asp Ser Gly Asp Asp Asp Asp Leu He 

195 200 205 

Ser Leu Ala Glu Thr Thr Ser Lys Lys Lys Phe Ser Tyr Asn Thr Ser 

210 215 220 

His Ser Ser Ser Pro Phe Thr Arg Asn He Ser Arg Asp Asn Ser Lys 
225 230 235 240 

Lys Lys Ser Arg Pro Asn Gin Ala Pro Ser Arg Ser Tyr Asn Pro Ser 

245 250 255 

His Ser Gin Pro Ser Ala Thr Ser Lys Ser Ser Lys Phe Asn Lys Gin 

260 265 270 

Asn Glu Glu Arg Tyr Gin Trp Leu Val Asp Glu Arg Asp Ala Gin Arg 
275 280 285 



■ 



A T"0 


Dm 


xjy o 


Cpr 

uC J. 


Asn 


Pro 


Glu 


Tvr 

±y x 


Asp 


Pro 


Ara 


Thr 


Leu 


Tyr 


He 


Pro 




290 










295 










300 










Cov 
OCJ. 


C <=i >- 

OCX 


Ala 




Asn 


Lys 


Phe 


Thr 


Pro 


Phe 


Glu 


Lys 


Gin 


Tyr 


Trp 


Glu 


J -J 










310 










315 










320 


Tie 

X X c 


T,VC! 

xjy s 


c -i- 




Met 


x x p 


Asp 


Cvs 


He 


Val 


Phe 


Phe 


Lys 


Lys 


Gly Lys 










325 










330 










335 




JrilC 


Jriic 


ul Li 


XJC Li 


Tvt 
± y x. 


Glu 


Lys 


Asp 


Ala 


Leu 


Leu 


Ala 


Asn 


Ala 


Leu 


Phe 








340 










345 










350 








XJC LL 


T,VQ 


Tie 

X X c 


Ala 

ni, Cl 


G~\ v 


Glv 

«xy 


Glv 

vjxy 


Arg 


Ala 


Asn 


Met 


Gin 


Leu 


Ala 


Gly 






355 










360 










365 








Tie 

_L A. C 


Pro 


Glu 


Met 


Ser 


Phe 


Glu 


Tvr 

4.y x 


TrD 

X X£/ 


Ala 


Ala 


Gin 


Phe 


He 


Gin 


Met 




3 70 










375 










380 










m v 

uxy 


TVT 

iyx 


xj j o 


Val 


Ala 


Lys 


Val 


Asp 


Gin 


Arcr 


Glu 


Ser 


Met 


Leu 


Ala 


Lys 


385 










390 










395 










400 


Glu 


Met 


jm.x y 


Glu 


Glv 


Ser 


Lys 


Glv 


He 


Val 


Lvs 


Arq 


Glu 


Leu 


Gin 


Cys 










405 










410 










415 




lie 

x x c 


Leu 


Thr 


Ser 


Glv 


Thr 


Leu 


Thr 


Asp 


Glv 


Asp 


Met 


Leu 


His 


Ser Asp 








420 










425 










430 






T .en i 

XJC Li 


Al a 


Th-r 

1 ilX 


Php 

JL7 1 1C 


Ly D 


XJC Li 


Ala 


He 


Arcr 


Glu 


Glu 


Pro 


Glv 


Asn 


Phe 


Tvr 






435 

-J %j 










440 










445 








M Oil 


Glu 


Thr 


Gin 


Leu 


Asp 


Ser 


Ser 


Thr 


He 


Val 


Gin 


Lvs 


Leu 


Asn 


Thr 




450 










455 










460 










xj y o 


lie 

X J- w 


Phe 


Gly 


Ala 


Ala 


Phe 


He 


Asp 


Thr 


Ala 


Thr 


Glv 

JL 


Glu 


Leu 


Gin 


465 










470 










475 










480 


Met 


Leu 


Glu 


Phe 


Glu 


Asp 


Asp 


Ser 


Glu 


Cvs 


Thr 


Lvs 
i " 


Leu 


Asp 


Thr 

A. Ill 


Leu 










485 










490 










** J J 




Met 

l* 




Gin 

\J X il 


Val 


Arcr 


Pro 


Met 


Glu 


Val 


Val 


Met 


Glu 


Arq 


Asn 




Tien 

XJC Li. 








500 










505 










510 






OCX 


X 1-1X 


XJC Li 


Al a 




xj y o 


He 

-JL -L 


Val 


Lys 


Phe 


Asn 


Ser 


Ala 


Pro 


ten 


Ala 






515 










520 










525 








lie 


Phe 

jCT J-i 


Asn 


Glu 


Val 


Lys 


Ala 


Glv 


Glu 


Glu 


Phe 


Tvr 


Asp 


Cvs 


no \J 


T ,\/c 

xjy o 




530 










535 










540 










Thr 




Ala 


Glu 


He 


He 


Ser 


Ser 


Glu 


Tvr 


Phe 


Ser 


Thr 


Glu 


Glu 


Asp 


545 










550 










555 










560 


Tro 


Pro 


Glu 


Val 


Leu 


Lys 


Ser 


Tvr 


Tvr 


Asp 


Thr 


Glv 


Lvs 


Lvs 


Val 


Gly 










565 










570 










575 




Phe 


Set* 


Ala 


Phe 


Glv 


Glv 


Leu 


Leu 


Tvr 


Tvr 


Leu 


Lvs 


Trp 


Leu 


Lys 


Leu 








580 










585 










590 






A«?n 

riij £V 


XJjr O 


Asn 


Leii 

J— 1 Li 


He 


Ser 


Met 


Lys 


Asn 


He 


Lys 


Glu 


Tvr 


Asp 


Phe 


Val 






595 










600 










605 








XJjr 


Ser 


Gin 


His 


Ser 


Met 


Val 


Leu 


Asp 


Glv 

vjxy 


He 


Thr 


Leu 


Gin 


Asn 


Leu 




610 










615 










620 










m n 

U-LU 


Tie 

X X c 


JriiC 


OCX 


Sen 

no ii 


O C X 


Phe 

IT 11C 




Gl v 
oxy 


ocx 


Asp 


Lys 


Gly 


Thr 


Leu 


Phe 


625 

V Ai -J 










630 










635 










640 


xj y o 


XJC Li 


Phe 

Jr lie 


A ^Tl 




Ala 


He 

A- -L C 


Thr 


Pro 

JT X 


Met 


Gly 


Lys 


Arg 


Met 


Met 


Lys 










645 










650 










655 




T iVcj 
xj_y o 


±x,p 


XJC Li 


i 1C L* 






T iPi l 

XJC LA. 


XJC Li 


A TO 


TiVR 
J— J j o 


Asn 


Asp 


He 


Glu 


Ser 


Arg 








660 

\J \j \J 










665 










670 






XJC Ll 




O ex 


Val 

vox 




Car 

OCX 




Tien 

XJC Li 


Gin 




He 


Thr 


Leu 


Ara 

«.x y 


Glu 


Gin 






675 










680 










685 








Leu 


Glu 


He 


Thr 


Phe 


Ser 


Lys 


Leu 


Pro 


Asp 


Leu 


Glu 


Arg 


Met 


Leu 


Ala 




690 










695 










700 










Arg 


lie 


His 


Ser 


Arg 


Thr 


He 


Lys 


Val 


Lys 


Asp 


Phe 


Glu 


Lys 


Val 


He 


705 










710 










715 










720 


Thr 


Ala 


Phe 


Glu 


Thr 


He 


He 


Glu 


Leu 


Gin 


ASp 


Ser 


Leu 


Lys 


Asn 


Asn 










725 










730 










735 




Asp 


Leu 


Lys 


Gly 


Asp 


Val 


Ser 


Lys 


Tyr 


He 


Ser 


Ser 


Phe 


Pro 


Glu Gly 










740 










745 










750 






Leu 


Val 


755 


Al a 


Val 

V 0. -L 


uy o 


OCX 


i i.y 
7 60 


Thr 


Asn 


Ala 


Phe 


Glu 
765 


Ara 


Gin 


Lys 


Ala 


He 
770 


n.911 


UlU 


Moil 


He 

X X w 


lie 
775 


Val 


Pro 


Gin 


Ara 


Glv 
780 


Phe 


Asp 


lie 


Glu 


Phe Asp 


TjVS 


^ ^— ±- 


Met 


Asp 


Ara 


He 


Gin 


Glu 


Leu 


Glu 


Asp 


Glu 


Leu 


Met 


785 










790 










795 










800 


Glu 


He 


ucu 


Met 


Thr 
805 


Tvr 


Ara 


Lys 


Gin 


Phe 
810 


Lvs 


Cvs 


Ser 


Asn 


He 
815 


Gin 


Tyr 


Lys 




820 




XJy o 


Glu 


He 


Tvr 
825 


Thr 


He 


Glu 


He 


Pro 
830 


He 


Ser 


Ala 


Thr 


835 




Val 


Pro 


Ser 


Asn 
840 


TrD 


Val 


Gin 


Met 


Ala 
845 


Ala 


Asn 


Lvs 


Thr 


Tyr 
850 


j_j y o 




Tvr 
j.y x 


Tvr 


Ser 
855 


Asp 


Glu 


Val 


Ara 


Ala 
860 


Leu 


Ala 


Ara 


Ser 


Met 


Ala 


Glu 


Ala 


Lys 


Glu 


He 


His 


Lys 


Thr 


Leu 


Glu 


Glu 


Asp 


Leu 


Lys 


865 










870 










875 










880 


Asn Arg 


u u 




Gin 


Lys 


Phe 


Asp 


Ala 


His 


Tvr 


Asn 


Thr 


He 


Trp 


Met 










885 










890 










895 




Pro 


Thr 


He 


Gin 
900 


Ala 


He 


Ser 


Asn 


He 
905 


Asp 


Cvs 


Leu 


Leu 


Ala 
910 


lie 


Thr 


Arg 


Thr 


iD C -L 

915 


Glu 


Tvr 

y 


Leu 


Glv 


Ala 
920 


Pro 


Ser 


Cvs 


Ara 

rix y 


Pro 
925 


Thr 


lie 


Val 


Asp 


Glu 
930 


Val 


Asp 


Ser 


Lys 


Thr 
935 


Asn 


Thr 


Gin 


Leu 


Asn 
940 


Glv 


Phe 


Leu 


Lvs 


Phe 


Lys 


Ser 


Leu 


Atct 
rtxy 


His 


Pro 


Cys 


Phe 


Asn 


Leu 


Glv 

«x y 


Ala 


Thr 


Thr 


Ala 


945 










950 










955 










960 


Lys 


Asp 


Phe 


He 


Pro 
965 


Asn 


Asp 


He 


Glu 


Leu 
970 


Glv 


Lys 


Glu 


Gin 


Pro 
975 


Ara 


Leu 


Gly 


Leu 


Leu 
980 


Thr 


Gly 


Ala 


Asn 


Ala 
985 


Ala 


Gly 


Lys 


Ser 


Thr 
990 


lie 


Leu 


Arg 


Met 


Ala 


Cys 


He 


Ala 


Val 


lie 


Met 


Ala 


Gin 


Met 


Gly 


Cys 


Tyr 


Val 






995 










1000 








1005 






Pro 


Cys 


Glu 


Ser 


Ala 


Val 


Leu 


Thr 


Pro 


lie 


Asp 


Arg 


He 


Met 


Thr 


Arg 




1010 








1015 








1020 








Leu Gly Ala Asn 


Asp 


Asn 


He 


Met 


Gin 


Gly 


Lys 


Ser 


Thr 


Phe 


Phe 


Val 


1025 








1030 








1035 








104i 


Glu 


Leu 


Ala 


Glu 


Thr 


Lys 


Lys 


He 


Leu 


Asp 


Met 


Ala 


Thr 


Asn 


Arg 


Ser 










1045 








1050 








1055 


Leu 


Leu 


Val 


Val 


Asp 


Glu 


Leu 


Gly Arg 


Gly Gly Ser 


Ser 


Ser 


Asp Gly 








1060 








1065 








1070 




Phe 


Ala 


He 


Ala 


Glu 


Ser 


Val 


Leu 


His 


His 


Val 


Ala 


Thr 


His 


He 


Gin 






1075 








1080 








1085 






Ser 


Leu Gly Phe 


Phe 


Ala 


Thr 


His 


Tyr Gly Thr 


Leu 


Ala 


Ser 


Ser 


Phe 




1090 








1095 








1100 








Lys 


His 


His 


Pro 


Gin 


Val 


Arg 


Pro 


Leu 


Lys 


Met 


Ser 


He 


Leu 


Val 


Asp 


1105 








1110 








1115 








112 


Glu Ala Thr Arg Asn Val 


Thr 


Phe 


Leu 


Tyr 


Lys 


Met 


Leu 


Glu Gly Gin 










1125 








1130 








1135 


Ser 


Glu Gly 


Ser 


Phe 


Gly Met 


His 


Val 


Ala 


Ser 


Met 


Cys 


Gly 


lie 


Ser 








1140 








1145 








1150 




Lys 


Glu 


He 


He 


Asp 


Asn 


Ala 


Gin 


He 


Ala Ala Asp Asn Leu Glu 


His 






1155 








1160 








1165 






Thr Ser Arg Leu Val 


Lys Glu Arg Asp 


Leu 


Ala 


Ala 


Asn 


Asn 


Leu 


Asn 




1170 








1175 








1180 








Gly Glu Val 


Val 


Ser 


Val 


Pro Gly Gly Leu Gin 


Ser 


Asp 


Phe 


Val 


Arg 



1185 1190 1195 1200 



1 



He Ala Tyr Gly Asp Gly Leu Lys Asn Thr Lys Leu Gly Ser Gly Glu 

1205 1210 1215 

Gly Val Leu Asn Tyr Asp Trp Asn He Lys Arg Asn Val Leu Lys Ser 

1220 1225 1230 

Leu Phe Ser He He Asp Asp Leu Gin Ser 
1235 1240 



